C0P8™ Instruction Set 
Performance Evaluation 



1.0 OVERVIEW 

National offers two families of 8-bit COP8 microcontrollers: 
Basic family and Feature family. The Feature family offers 
more on-chip peripheral and nine additional instructions 
compared to the Basic family. In this report, the COP8 Basic 
family instruction set performance is evaluated versus that 
of three competitive microcontrollers, the Motorola 
M68HC05, the Intel 80C51, and the Microchip PIC16C5X. 
The architecture, addressing modes, instruction sets and 
salient features of these microcontrollers are compared. 
Eight benchmark programs are developed, with each micro- 
controller programmed with full documentation for each of 
the benchmarks. Summary tables compare the results rela- 
tive to both code efficiency and execution time. 
The report examines only the instruction set efficiency and 
speed of execution of the selected microcontrollers. Factors 
related to on-chip hardware features (RAM/ROM sizes, in- 
terrupts, etc.) are not considered. Manufactures offer a vari- 
ety of options of hardware features, but the instruction set 
efficiency and execution speed typically remain identical 
across a manufacturer's microcontroller product line. 
When comparing a microcontroller to the competition, it is 
customary for manufacturers to select benchmark programs 
which particularly highlight the instruction set of their device. 
To avoid this phenomenon, general purpose commonly 
used benchmark routines were selected. 
One word of caution — this benchmark report, like all others 
of its kind, relies on a set of small program fragments. The 
operations performed by each of these fragments may or 
may not correspond to the operations required for a particu- 
lar application. When evaluating a microcontroller for a de- 
manding application, it is important to examine how an indi- 
vidual microcontroller will perform in that particular case. 

ARCHITECTURE 

Three of the four microcontrollers have a modified Harvard 
architecture, while a fourth (the Motorola M68HC05) has a 
Von Neumann architecture (named after John Von 
Neumann, an early pioneer in the computer field at Prince- 
ton). With a Von Neumann architecture, a CPU (Central Pro- 
cessing Unit) and a memory are interconnected by a com- 
mon address bus and a data bus. Positive aspects of this 
approach include convenient access to tables stored in 
ROM and a more orthogonal instruction set. The address 
bus is used to identify which memory location is being ac- 
ccessed, while the data bus is used to transfer information 
either from the CPU to the selected memory location or vice 
versa. Von Neumann was the first to realize that this archi- 
tectural model could have the memory serve as either pro- 
gram memory or data memory. In earlier computers (both 
electronic and electromechanical), the program store (often 
a programmed patchboard) had been completely separate 
from the data store (often a bank of vacuum tubes or re- 
lays). 
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The single address bus of the Von Neumann architecture is 
used sequentially to access instructions from program 
memory and then execute the instructions by retrieving data 
from and/or storing data in data memory. This means that 
instruction fetch cannot overlap data access from memory. 
A Harvard architecture (names after the Harvard Mark 1 and 
the early electromechanical computers developed at Har- 
vard by Howard Aiken — another computer pioneer) has sep- 
arate program memory and data memory with a separate 
address bus and data bus for each memory. One of the 
benefits of the Harvard architecture is that the operation of 
the microcontroller can be controlled more easily in the 
event of corrupted program counter. A modified Harvard ar- 
chitecture allows accessing data tables from program mem- 
ory. This is very important with modern day computers, 
since the program memory is usually ROM (Read Only 
Memory) while the data memory is usually RAM (Random 
Access Memory). Consequently, data tables usually need to 
be in ROM so that they are not lost when the computer is 
powered down. 

The obvious advantage of a Von Neumann architecture is 
the single address and single data bus linking memory with 
the CPU. A drawback is that code can be executed from 
data memory opening up the possibility for undesired opera- 
tion due to corruption of the program counter or other regis- 
ters. Alternatively, the advantage of a modified Harvard ar- 
chitecture is that instruction fetch and memory data trans- 
fers can be overlapped with a two stage pipeline, which 
means that the next instruction can be fetched from pro- 
gram memory while the current instruction is being executed 
using data memory. A drawback is that special instructions 
are required to access RAM and ROM data values making 
programming more difficult. 

The instructions which cause the "Modified" Harvard archi- 
tectures of the three microcontrollers are the instructions 
that provide a data path from program memory to the CPU. 
These instructions are listed below: 



COP8 


LAID 


Load Accumulator 
indirect from 
program memory 


80C51 


MOVC A, @A + DPTR 


Move Constant - 
Load Accumulator 
with a "fixed 
constant" from 
program memory 


PIC16C5X 


RETLW 


Return Literal to 
W from program 
memory 



TRI-STATE® is a registered trademark of National Semiconductor Corporation. 
COP8TM and WATCHDOGtm are trademarks of National Semiconductor Corporation. 
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C0P8 offers the single-byte LAID instruction which uses the 
contents of the accumulator to point to a data table stored 
in the program memory. The data accessed from the pro- 
gram memory is transferred to the accumulator. This in- 
struction can be used for table lookup operations and to 
read the entire program memory contents for checksum cal- 
culations. 

The 80C51 family offers a similar instruction. The 
MC68HC05 family has a Von Neumann architecture where 
CPU and program memory are interconnected by a com- 
mon address and data bus. 

Microchips's PIC16C5x family offers the RETLW instruction. 
To do table lookup, the table must contain a string of 
RETLW instructions. The first instruction just in front of the 
table of the RETLW instructions, calculates the offset into 
the table. The table can only be used as a result of a CALL. 
This instruction certainly does not offer the flexibility of the 
COP8 LAID instruction and cannot be used to perform a 
checksum calculation on the entire program memory con- 
tents. 

The use these instructions is demonstrated in the fourth 
benchmark program, which entails a three byte table search 
from a data table in program memory. 

2.0 SELECTED MICROCONTROLLERS 
COP8 

The COP8 has a modified Harvard architecture with memory 
mapped input/output. The CPU registers include an 8-bit 
accumulator (A), a 16-bit program counter (PC), two 8-bit 
data pointers (B and X), an 8-bit stack pointer (SP), an 8-bit 
processor status word (PSW), and an 8-bit control register 
(CNTRL). The data memory includes a bank of 16 registers 
(including the three pointers) which have special attributes. 
All RAM, I/O ports, and registers (except A and PC) are 
mapped into the data memory address space. The timer 
section includes a 16-bit timer, and an associated 16-bit au- 
toreload register. The generic COP8 I/O section includes 
two 8-bit I/O ports, each with an associated 8-bit configura- 
tion register and an associated 8-bit data register. The I/O 
section also contains one dedicated 8-bit output port with 
an associated 8-bit data register, one dedicated 8-bit input 
register, and one special purpose 8-bit I/O port with associ- 
ated 8-bit configuration register and 8-bit data register. 

M68HC05 

The M68HC05 has a Von Neumann architecture with mem- 
ory mapped input/output. The CPU registers include an 
8-bit accumulator (A), a 16-bit program counter (PC), an 8- 
bit index register (X), a stack pointer (SP), and an 8-bit con- 
dition code register (CCR). The timer section includes a 16- 
bit free running counter, two 16-bit counter registers to read 
the counter, a 16-bit timer input capture register, a 16-bit 
counter output compare register, an 8-bit timer control regis- 
ter, and an 8-bit timer status register. The I/O section in- 
cludes three bidirectional 8-bit I/O ports, each with an asso- 
ciated 8-bit data register and an 8-bit direction register. The 
I/O section also includes a 7-bit input port with an associat- 
ed input register. 



80C51 

The 80C51 has a modified Harvard architecture with memo- 
ry mapped input/output. The CPU registers include an 8-bit 
accumulator (A), an 8-bit auxilliary register (B) for multiply 
and divide, a 16-bit program counter (PC), and 8-bit program 
status word (PSW), an 8-bit stack pointer (SP), and a 16-bit 
data pointer (DPTR). The register bank consists of eight 
special working registers R0-R7. Registers R0 and R1 can 
be used as indirect address pointers to data memory. The 
timer section includes two 16-bit timers, an 8-bit timer mode 
register (TMOD), and an 8-bit timer control register (TCON). 
The I/O section includes four 8-bit I/O ports, each with an 
associated 8-bit register. 

PIC16C5X 

The PIC15C5X has a modified Harvard architecture with 
memory mapped input/output. The PIC16C5X also has a 
RISC (Reduced Instruction Set Computer) type architecture 
in that there are only 33 single word basic instructions. Actu- 
ally these 33 instructions should be expanded to a total of 
47 for comparison purposes with the other microcontrollers. 
This is necessary since 14 of the 33 instructions have a 
programmable destination bit, which selects one of two des- 
tinations for the result of the instruction. Consequently, each 
of these 14 instructions should be counted as dual instruc- 
tions. The CPU registers include an 8-bit working register 
(W) which serves as a pseudo accumulator in that it holds 
the second operand, receives the literal in the immediate 
type instructions, and also can be program selected as the 
destination register. However, a bank of 31 file registers 
serve as the primary accumulators in that they represent the 
first operand and also may be program selected as the des- 
tination register. The first eight file registers include the real 
time clock/counter register (RTCC) mapped as F1, the 9-bit 
program counter (PC) mapped as F2, the 8-bit status word 
register (SWR) mapped as F3, and the 8-bit I/O port regis- 
ters mapped as F5-F7. The 8-bit File Select Register (FSR) 
is mapped as F4, whose low order 5 bits select one of the 
31 file registers in the indirect addressing mode. Calling for 
file F0 in any of the file oriented instructions selects indirect 
addressing and will use the File Select Register (FSR). It 
should be noted that file register F0 is not a physically im- 
plemented register. The CPU also contains a two level 
12-bit hardware push/pop stack for subroutine linkage. The 
PIC16C5X also has a WATCHDOGtm timer as well as the 
real time clock/counter register (RTCC), but it does not 
have any hardware interrupts. Consequently, the counter 
register RTCC must be program sampled for any overflow. 
The number of instructions in a program must also be cali- 
brated differently with the PIC16C5X since the instruction 
word is 12 bits in length. Consequently, twenty 12-bit word 
instructions will contain the byte equivalent of thirty C0P8 
8-bit byte instructions. The larger size instruction word is 
instrumental in implementing the RISC architecture. It 
should also be noted that the upper members of the PIC 
family expand to having a 16-bit instruction word. 
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3.0 INSTRUCTION SET ANALYSIS 


68HC05 


Addressing Modes 


Total basic instructions: 62 




Total instructions including addressing modes: 210 


COP8 


1. Register/Memory 


1. Direct 


2. Read/Modify/Write 


2. Register Indirect 


3. Branch 


3. Register Indirect with Post Increment/Decrement 


4. Control 


4. Immediate 






80C51 


5. Immediate Short 






Total basic instructions: 51 


6. Indirect from Program Memory 


Total instructions including addressing modes: 1 1 1 


7. Jump Relative 


1. Arithmetic 


8. Jump Absolute 


2. Logical 

3. Data Transfer 


9. Jump Absolute Long 


10. Jump Indirect 


4. Boolean Variable 


68HC05 


5. Program Branching 


1. Inherent 


PIC16C5X 


2. Immediate 


Total basic instructions: 33 


3. Extended 


Total instructions with 14 dual destination instructions 


4. Direct 


counted: 47 


5. Indexed with no offset 


1 . Byte-oriented File Register 


6. Indexed with 8-bit offset 


2. Bit-oriented File Register 


7. Indexed with 16-bit offset 


3. Literal and Control 


8. Relative 


COP8 Instruction Set Features 


80C51 


1 . Majority of single byte opcode instructions to minimize 


1. Register 


program size. 


2. Direct 


2. One instruction cycle for the majority of single byte in- 


3. Indirect 


structions to minimize program execution time. 


4. Immediate 


3. Many single byte multiple function instructions such as 




DRSZ. 


5. Relative 


4. Three memory mapped pointers: Two data pointers (B 


6. Absolute 


and X) for register indirect addressing, and one pro- 


7. Long 


gram memory stack pointer (SP) for the software stack. 


8. Indexed 


5. Sixteen memory mapped registers which allow an opti- 


PIC16C5X 


mized implementation of certain instructions. 


1 . Data Direct 


6. Ability to set, reset, and test any individual bit in data 




memory address space, including the memory mapped 


2. Data Indirect 


I/O ports and associated registers. 


3. Immediate 


7. Register indirect LOAD and EXCHANGE instructions 


4. Program Direct 


with optional automatic post-incrementing or post-dec- 


5. Program Indirect 


rementing of the register pointers (Both B and X point- 


6. Relative 


ers). This allows for greater efficiency (both in through- 




put time and program code) in both loading and pro- 


Instruction Types 


cessing fields in data memory. 


COP8 


8. Unique instructions to optimize program size and 


Total basic instructions: 49 


throughput efficiency. Some of these instructions are: 


Total instructions including addressing modes: 87 


DRSZ IFBNE DCOR 


1. Arithmetic 


RETSK RRC LAID 


2. Load and Exchange 


9. Forty nine basic instructions. 


3. Logical 


10. Ten addressing modes provide great flexibility. 


4. Bit Manipulation 




5. Conditional 




6. Transfer of Control 
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M68HC05 Instruction Set Features 

1. Very flexible branch structure with 21 different branch 
instructions. 

2. Twelve different read/modify/write instructions. 

3. Five bit manipulation instructions (Clear, Set, Branch if 
Bit Clear, Branch if Bit Set, Bit Test Memory with Accu- 
mulator) provide great flexibility. 

4. Indexed addressing with options of no offset, 8-bit offset, 
or 1 6-bit offset. 

5. Data Tables located in page address space (0000- 
00FF) take advantage of the direct addressing mode for 
optimal code. 

6. Multiply instruction (unsigned, 8 X 8). 

7. Sixty two basic instructions. 

8. Eight addressing modes provide great flexibility. 

80C51 Instruction Set Features 

1 . Optimized for 8-bit control applications. 

2. Provides a variety of fast compact addressing modes 
for accessing data memory to facilitate operations on 
small data structures. 

3. Offers extensive support for one bit variables, allowing 
direct bit manipulation in control and logic systems that 
require Boolean processing. 

4. Eight working registers (R0-R7) selected by three bits 
allow a function code and register address to be com- 
bined in one single byte instruction. 

5. Register RO and R1 serve as indirect addressing data 
memory pointers. 

6. Four banks of working registers, with only one bank 
active at a time, permit fast and effective "context 
switching". 

7. Several register specific instructions referring to the ac- 
cumulator (A), the accumulator and auxiliary register 
(B), register pair (AB), the carry flag (C), the data point- 
er (DPTR), and the program counter (PC) provide great 
efficiency. 

8. Indexed addressing using a base register (either the 
program counter (PC) or the data pointer (DPTR) and 
an offset in the accumulator (A) provide great flexibility. 

9. Multiply and Divide instruction (using A and B registers). 

10. Fifty one basic instructions. 

1 1 . Eight addressing modes provide great flexibility. 

PIC16C5X Instruction Set Features 

1 . All single word (1 2-bit) instructions for compact code effi- 
ciency. 

2. All instructions are single cycle except the jump type in- 
structions (GOTO, CALL) and failed test instructions 
(DECFSZ, INCFSZ, BTFSC, BTFSS) which are two cy- 
cle. 

3. 32 File registers can be addressed directly or indirectly, 
and serve as accumulators to provide first operand, 
Working register (W) serves as pseudo accumulator, pro- 
viding second operand. 

4. Working register (W) also serves as destination for literal 
from program memory in MOVLW (Move Literal to W) 
and RETLW (Return Literal to W) instructions. 



5. Many instructions include a destination bit which se- 
lects either the register file or the accumulator as the 
destination for the result. 

6. Four bit manipulation instructions (Set, Clear, Test and 
Skip if Set, Test and Skip if Clear) provide great flexibili- 
ty. 

7. Status word register (SWR) memory mapped as regis- 
ter file F3 allows testing of status bits (carry, digit carry, 
zero, power down, and time-out). 

8. Program counter (PC) memory mapped as register file 
F2 allows W to be used as offset register for indirect 
addressing of program memory. 

9. Indirect addressing mode data pointer FSR (file select 
register) memory mapped as register file F4. Address- 
ing FO causes FSR to be used to select file register. 

10. Literal in RETLW (Return Literal to W) instruction com- 
bined with file register mapped program counter allow 
data tables to be accessed from program memory. 

11. Two level 12-bit push/pop hardware stack for subrou- 
tine linkage using the CALL and RETLW instructions. 

12. WATCHDOG timer. 

13. Thirty three basic instructions. 

14. Six addressing modes provide great flexibility. 

COMPARISON OF SALIENT INSTRUCTION SET 
FEATURES 

Addressing 

The three types of indexed addressing (no offset, single 
byte offset, and dual byte offset) in the M68HC05 provide 
great flexibility in walking across two or three data fields 
(two operands, result) simultaneously. The three other mi- 
crocontrollers achieve the same result with indirect address- 
ing and pointers. The COP8 and 80C51 both have dual 
pointers for indirect data addressing (B and X pointers in the 
COP8, RO and R1 in the 80C51). The PIC16C5X only has 
one indirect data pointer F4, which is called by using FO. 
Consequently, processing two multiple byte operands with 
the PIC16C5X requires alternate loading of the operand ad- 
dresses into the F4 pointer as the multiple byte data fields 
are processed. 

All four microcontrollers have some form of indirect ad- 
dressing for program memory, which is very useful in pro- 
ducing jump tables. The COP8 uses the JID (jump indirect) 
instruction, while the PIC16C51X can address the program 
counter as file register F2 and add an offset to it. The 
M68HC05 and 80C51 both use their indexed addressing 
modes to achieve indirect addressing of program memory. 
Both jump tables and lookup tables are easily created with 
indexed addressing. The COP8 uses the LAID (load accu- 
mulator indirect) instruction to implement lookup tables, 
while the PIC16C5X uses the RETLW (return literal to W) 
instruction as the table entries preceded by a table header 
which produces the table offset address. 
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Bit Manipulation 

All four microcontrollers have instructions to set, reset, and 
test individual bits in data memory. However, the 80C51 al- 
lows bit manipulation in only 210-bit addressable locations 
of data memory (16 general purpose byte locations 20-2F, 
and the rest in the special function registers including the 
I/O ports). The other three microcontrollers are capable of 
bit addressing anywhere in data memory. Three of the four 
microcontrollers have direct bit tests for a bit being either 
set or reset, while the COP8 test is for bit set only. The 
80C51 has two instructions (MOV C, bit and MOV bit, C) to 
either transfer a bit of data memory to or from the carry. This 
is very handy for saving a carry and restoring it for later use. 
The two bit test instructions (BRCLT and BRSET) in the 
M68HC05 also set the carry to the state of the bit tested. 
The 80C51 also has four logical (two logical AND, two logi- 
cal OR) bit manipulation instructions, with the carry and a 
selected memory bit serving as the two operands. These 
four instructions are described more fully with the logical 
instructions. 

Input/Output 

The four microcontrollers all have mulitple I/O ports, with 
some of them (COP8, M68HC05) also having dedicated in- 
put and/or dedicated output ports. The COP8 and 
M68HC05 both have configuration registers for each I/O 
port, with each configuration bit determining whether the 
associated port bit is selected as input or output. The 
PIC16C5X contains a TRISTATE® instruction (TRIS) whose 
operand determines whether or not each associated port bit 
is put in the TRISTATE condition to serve as an input pin. 
The 80C51 configures a port by writing ones to TRISTATE 
the port bit positions selected as inputs. Otherwise the as- 
sociated 80C51 port bit is selected as an output bit. The 
COP8 has three addresses associated with each I/O port. 
The first two addresses select the associated data and con- 
figuration register for the port, while the third address se- 
lects the actual port pins. If a COP8 port bit is configured as 
an input, then the associated bit in the data register selects 
whether the associated input is Hi-Z or weak pull-up. Input/ 
Output manipulation with the I/O ports is demonstrated in 
the fifth benchmark program. 

Increment/ Decrement 

All four micrcontrollers have accumulator increment and 
decrement instructions. The register file of the PIC16C5X 
serves as multiple accumulators. The COP8 has a DRSZ 
(decrement register and skip if zero) instruction for its six- 
teen registers located at data memory addresses 0F0-0FF. 
This register bank includes the B and X data memory point- 
ers and also the SP stack pointer. The PIC16C5X also has 
both INCFSZ (increment file and skip if zero) and DECFSZ 
(decrement file and skip if zero) instructions. The M68HC05 
increment and decrement instructions can be selected for 
the accumulator, the index register, or any memory location. 
The 80C51 increment and decrement instructions can be 
selected for the accumulator, the eight R registers including 
the RO and R1 data pointers, or any memory location (se- 
lected either directly or indirectly). The 80C51 also has a 
DJNZ (decrement and jump if not zero) instruction which 
can be selected for any of the eight R registers including the 
RO and R1 data pointers, or for any memory location select- 
ed with a relative address. 



Load Memory Immediate 

Only the COP8 and the 80C51 have instructions that can 
load memory with an immediate value. The memory location 
to be loaded can be selected with either direct or indirect 
addressing for both microcontrollers. 

Post Incrementation or Decrementation of Data 
Pointers 

Only the COP8 has instructions that can post increment, 
post decrement, or leave the data pointer unchanged. This 
feature applies to both the COP8 load and exchange in- 
structions, and can be used with either of the two pointers B 
and X. The feature is very useful when processing multiple 
byte fields, especially with two operands (additions, subtrac- 
tions) or an operand and a result (block move). Examples of 
the usage of this feature can be found in the first three 
COP8 benchmark programs. 

Loop Counting and Data Pointing Testing 

Three of the four microcontrollers (excluding M68HC05) 
have specific instructions to facilitate loop counting. The 
COP8 DRSZ (decrement register and skip if zero) instruc- 
tion tests one of sixteen registers (including the two data 
pointers B and X) and skips the next instruction (a branch 
back to loop) if the result is zero. The PIC16C5X DECFSZ 
(decrement file register and skip if zero) is analogous to the 
COP8 DRSZ instruction. The 80C51 DJNZ (decrement and 
jump if not zero) instruction combines both the test and 
jump in a single instruction. These three instructions (COP8 
DRSZ, PIC16C5X DECFSZ, 80C51 DJNZ) all operate on a 
loop counter, but uses the standard BNE (branch if not 
equal) branch instruction following the decrement to test if 
the counter is zero. The BNE instruction tests the M68HC05 
zero status flag which is active with the decrement instruc- 
tion. Only the COP8 contains a test data pointer uction (IF- 
BNE — If B pointer not equal). This instruction is used to 
sense the end of a program loop where multiple byte fields 
are being processed. 

The 80C51 CJNE (compare and jump if not equal) instruc- 
tion used in the immediate addressing mode provides an 
alternative method of loop counting. This instruction is anal- 
ogous to the COP8 IFBNE instruction but combines the test 
with the jump in a single instruction. 

It should be noted that neither the COP8 IFBNE instruction 
nor the 80C51 CJNE instruction require the use of a coun- 
ter, but rather depend on an end of field comparison for loop 
termination. Consequently these instructions are used in 
loops where a data pointer is being used to walk across a 
mulitple byte field. Examples of the usage of these two in- 
structions can be found in the first three benchmarks. 

Branch and Subroutine Call Instructions 

All four micrcontrollers have absolute address branching, 
and three of the four (excluding the PIC16C5X) also have 
relative address branching. Only the COP8 and the 
M68HC05 have single byte branching. The COP8 single 
byte branching is with relative addressing, where six bits 
provide relative addressing of -31 to +32. The M68HC05 
single byte branching is with no offset indexed addressing, 
where the index register X provides the absolute address. 
All of the myriad conditional branch instructions of the 
M68HC05 use two bytes, with the second bytes providing 
relative addressing of -127 to +128. 
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Three of the four microcontrollers have a software stack 
(excluding the PIC16C5X) where the return address is 
stored when a subroutine is called. Consequently, multiple 
levels of subroutine and interrupt nesting are possible with 
the software stack. The PIC1 6C5X has a two level hardware 
stack where the return address is stored with a subroutine 
call. Thus only two levels of subroutine nesting are possible 
with the PIC16C5X. The other three microcontrollers have 
absolute address subroutine calls, with the 80C51 also hav- 
ing a relative address call. Only the M68HC05 provides a 
single byte subroutine call, using no offset indexed address- 
ing (with the index register providing the absolute address. 

Return Instructions 

All four microcontrollers have return from subroutine instruc- 
tions and three of the four (excluding the PIC16C5X) have 
return from interrupt instructions. The PIC16C5X does not 
have an interrupt. 

The COP8 has two return from subroutine instructions, RET 
and RETSK. The RET is the normal return from subroutine 
instruction, while the RETSK instruction returns from the 
subroutine and then skips the next instruction. These two 
instructions are very useful in passing back flag information 
from the subroutine in lieu of actually using a flag such as 
the carry. This is demonstrated in the fourth benchmark (ta- 
ble search) where a flag needs to be passed back from the 
subroutine to indicate whether or not the search was suc- 
cessful. The COP8 simply uses the RETSK instruction to 
indicate a successful search, whereas each of the other 
three microcontrollers need to set up the carry as the flag. 
The PIC16C5X return from subroutine instruction is RETLW 
(return and place literal in W). The literal serves as an imme- 
diate data value from memory. The use of this instruction is 
demonstrated in the fourth benchmark (table search), where 
the program memory data table consists of a block of 
RETLW instructions preceded by the table header. 

Comparison and Conditional Branch Instructions 

Three of the four microcontrollers (excluding the PIC16C5X) 
have comparison instructions. The COP8 has two compari- 
son instructions, IFEQ (if equal) and IFGT (if greater than). 
These comparison instructions compare the accumulator 
versus an immediate value or a number from memory (se- 
lected with either direct or indirect addressing). The 
M68HC05 has two comparison instructions, CMP (compare) 
and BIT (bit test memory with accumulator). The CMP in- 
struction compares the accumulator versus an immediate 
value or a number from memory (selected with either direct 
or indexed addressing). The BIT instruction performs a logi- 
cal AND comparison between the accumulator and an im- 
mediate value or a number from memory (selected with ei- 
ther direct or indexed addressing). The 80C51 has one com- 
parison instruction CJNE (compare and jump if not equal) 
which is combined with a branch in the same instruction. All 
four microcontrollers have bit test instructions (including test 
carry.) 

The M68HC05 has a myriad of test conditions and branch 
instructions, including BCC (branch if carry clear), BCS 
(branch if carry set), BRCLR (branch if memory bit clear), 
BRSET (branch if memory bit set), BEQ (branch if equal), 
BHI (branch if higher), BHS (branch if higher or same), BLO 
(branch if lower), BLS (branch if lower or same), BMI 
(branch if minus), BNE (branch if not equal), and BPL 
(branch if plus). 



The 80C51 includes seven conditional branch instructions, 
consisting if JC (jump if carry), JNC (jump if no carry), JNZ 
(jump if not zero), JZ (jump if zero), JB (jump if memory bit 
set), JNB (jump if memory bit not set), and JBC (jump if 
memory bit set and clear bit). 

The COP8 includes seven conditional test instructions, 
IFBIT (test if memory bit is set), IFC (if carry), IFNC (if no 
carry), IFEQ (if equal), IFGT (if greater than), IFBNE (if B 
pointer not equal), and DRSZ (decrement register and skip if 
zero). These seven instructions will all cause the next in- 
struction to be skipped if the test is successful. The skipped 
instruction can either be a branch or some function (such as 
setting a bit flag or returning from a subroutine) that can be 
contained in one instruction. 

The PIC16C5X includes four test instructions, consisting of 
DECFSZ (decrement file register and skip if zero), INCFSZ 
(increment file register and skip if zero), BTFSC (bit test file 
register and skip if bit clear), and BTFSS (bit test file register 
and skip if bit set). These four instructions will all cause the 
next instruction to be skipped if the test is successful. 

Logical Instructions (AND, OR, Exclusive OR) 

All four microcontrollers have a full complement of the logi- 
cal instructions, with the accumulator and a selected memo- 
ry location (using either direct or indirect addressing) serving 
as the two operands. Three of the four microcontrollers (ex- 
cluding the PIC16C5X) can also perform the logical instruc- 
tions with the accumulator and an immediate value serving 
as the operands. The 80C51 contains four logical bit instruc- 
tions which perform either the logical AND or logical OR 
between the carry bit and a selected memory bit, with either 
polarity of the memory bit being selectable. The M68HC05 
contains a logical compare instruction, which performs a 
logical AND comparison between the accumulator and an 
immediate value or a number from memory (selected with 
either direct or indexed addressing). 

Shift and Rotate Instructions 

Only the M68HC05 has shift instructions, both logical left 
shift and logical right shift. All four microcontrollers have a 
right rotate instruction through carry (none bit loop), and 
three of the four (excluding the COP8) also have a left ro- 
tate through carry. The 80C51 also has both a right and left 
rotate direct (eight bit loop). 

Complement Instructions 

Three of the four microcontrollers (excluding the COP8) 
have a complement instruction for the accumulator. The 
80C51 also has two complement bit instructions, with the 
first being used to toggle the carry and the second to com- 
plement selected accumulator bits. 

BDC Decimal Correct 

Only the COP8 and the 80C51 provide instructions to expe- 
dite BCD processing. The DCOR (decimal correct) instruc- 
tion of the COP8 is used following an add or subtract in- 
struction to achieve the correct BCD result. Note that a hex 
66 must be added to the first operand to start the addition 
process. The DA (decimal adjust) instruction of the 80C51 is 
used following an add instruction to achieve the correct 
BCD result. The decimal adjust instruction does not work 
following subtraction. Consequently, BCD subtraction in the 
80C51 must be implemented by adding the complement of 
the subtrahend (second operand) and then using the deci- 
mal adjust instruction. BCD subtraction is demonstrated in 
the third benchmark program. 
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Exchange and Swap 

Only the COP8 and the 80C51 have exchange instructions 
between the accumulator and memory. Both microcontrol- 
lers can select either direct or indirect addressing for the 
associated exchange memory selection. The 80C51 also 
has a XCHD A, @Ri (exchange digit) instruction which ex- 
changes the low order nibble (digit) of the accumulator with 
the low order nibble of the indirectly addressed memory lo- 
cation selected. 

All of the microcontrollers except the M68HC05 have a 
SWAP instruction which interchanges the low and high or- 
der nibbles of the accumulator. With the PIC16C5X, the 
SWAPF instruction can be selected for any of the register 
bank accumulators. 

Push and Pop Instructions 

Only the 80C51 has PUSH and POP instructions to augment 
operations with the software stack. Consequently, only the 
80C51 has the ability to store temporary data in the soft- 
ware stack as well as subroutine and interrupt return ad- 
dresses. 

4.0 BENCHMARK PROGRAMS 

1. FIVE BYTE BLOCK MOVE 

This benchmark program moves a block of five data byte 
from one location to another. 

2. FOUR BYTE BINARY ADDITION 

This benchmark subroutine adds two four byte binary num- 
bers and replaces the first operand with the result, like an 
adding machine (A + B replaces A). The carry flag is used 
to indicate an overflow. 

3. FOUR BYTE PACKED BCD SUBTRACTION 

This benchmark subroutine subtracts two eight digit packed 
BCD numbers and replaces the first operand with the result, 
like an adding machine. (A - B replaces A). The carry flag 
is used to indicate a negative result. 

4. THREE BYTE TABLE SEARCH 

This benchmark subroutine searches a program memory 
data table for a three byte match. A flag indicates whether 
or not the search was successful, with the address of the 
first byte of a matched string being returned. 



5. INPUT/OUTPUT MANIPULATION 

This benchmark compares two 8-bit I/O ports P1 and P2. If 
they are equal, a nine is output as the least significant digit 
of a third port P3. If P1 is greater than P2, then P2 data is 
output on P1 . If P1 is < P2, then the higher order digit of P1 
is copied to the lower order digit position of P3. 

6. SERIAL INPUT/OUTPUT WITH OFFSET TABLE 

This benchmark subroutine inputs a sequence of byte cou- 
plets, each of which consists of an address followed by a 
data byte. The address is used to access a data table to 
produce an offset value which is added to the data byte. 
The updated data byte is output while the next couplet's 
address byte is input. 

7. TIMEKEEPING 

This timekeeping benchmark program emulates a real time 
clock, keeping track of hours, minutes, and seconds in 
packed BCD format. The program is interrupt driven, using a 
timer interrupt. 

8. SWITCH ACTIVATED FIVE SECOND LED 

This benchmark samples a switch input to activate a five 
second output for turning on an LED. The switch is de- 
bounced with a 50 ms program delay both on opening and 
closure. 

5.0 BENCHMARK DATA 

This section provides the benchmark programs for each mi- 
crocontroller. For each instruction, the byte count and in- 
struction cycle count is given. The total cycle count is multi- 
plied by the fastest cycle time of the selected microcontrol- 
ler to yield the total benchmark execution time. 



Microcontrollers 


Instruction Cycle Time 


XTAL 


COP8 


1.0 ,us 


10 MHz 


M68HC05 


0.5 jas 


4 MHz 


80C51 


1.0 ,us 


12 MHz 


PIC16C5X 


0.5 jas 


8 MHz 
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COP8 BENCHMARK #1— FIVE BYTE BLOCK MOVE 

This benchmark moves only a block of five data bytes from a specific source location to a specific destination location. 



MOVE: 



LOOP: 



LD 

LD 

LD 

X 

IEBNE 

JP 



B, #14 
X, #50 
A, [B+] 
A, [X+] 
#3 
LOOP 



1/1 Source address to B pointer 

2/3 Destination address to X pointer 

1/2 Load A with source byte 

1/3 Source byte to destination 

1/1 Test (modulo 16) if block move finished (19-16 = 3) 

1/3 Loop back if not finished 



7 BYTES 
47 CYCLES 

68HC05 BENCHMARK #1— FIVE BYTE BLOCK MOVE 

This benchmark moves only a block of five data bytes from a specific source location to a specific destination location. 



2/2 Load X with block length 

2/4 Load A with source byte (start at block top) 

2/5 Store A at destination (start at block top) 

1/3 Decrement X 

2/3 Loop back if block transfer not finished 

9 BYTES 
76 CYCLES 



MOVE: 


LDX 


#5 


LOOP: 


LDA 


24, X 




STA 


54, X 




DECX 






BNE 


LOOP 
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80C51 BENCHMARK #1— FIVE BYTE BLOCK MOVE 

This benchmark moves only a block of five data bytes from a specific source location to a specific destination location. 



MOVE : MOV 
MOV 

LOOP: MOV 
MOV 
INC 
INC 
CJNE 



RO, #20 

Rl, #50 

A, @R0 

@R1, A 

Rl 

RO 

RO, #25, LOOP 



2/1 Load RO pointer with source address 

2/1 Load Rl pointer with destination address 

1/1 Load A with source byte 

1/1 Store source byte at destination 

1/1 Increment destination pointer 

1/1 Increment source pointer 

3/2 Compare & loop back if block transfer not finished 

11 BYTES 
32 CYCLES 



PIC16C5X BENCHMARK #1— FIVE BYTE BLOCK MOVE 

This benchmark moves only a block of five data bytes from a specific source location to a specific destination location. 



Source @ F20-F24 Destination @ F25-F29 

Load W with length of block 

Store block length in CNIR 

Load W with address of source 

Source address to W 

Source address to F4 (indirect pointer) 

Source byte to W 

Source byte to TEMP 

Destination/source address difference to W 

Add to pointer to get destination address 

Source byte to W 

Source byte to destination 

Increment BASE 

Test if block move finished 

Loop back if not finished 



MOVE: 



LOOP: 



TEMP 


EQU F29 




BASE 


EQU F30 




CNTR 


EQU F31 




MOVLW 


5 




1/1 


MOVWF 






1/1 


MOVLW 


20 




1/1 


MOVF 


BASE, 


W 


1/1 


MOVWF 


F4 




1/1 


MOVF 


FO, W 




1/1 


MOVWF 


TEMP 




1/1 


MOVLW 


5 




1/1 


ADDWF 


F4 




1/1 


MOVF 


TEMP, 


w 


1/1 


MOVWF 


FO 




1/1 


INCF 


BASE 




1/1 


DECFSZ 


CNTR 




1/1 


GOTO 


LOOP 




1/2 



14 WORDS (Equivalent to 21 BYTES) 
62 CYCLES 
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ADDITION: 


LD 




LD 




RC 


LOOP: 


LD 




ADC 




X 




IFBNE 




JP 




RET 



B, 


#10 


X, 


#20 


A, 


[X+] 


A, 


[B] 


A, 


[B+] 


#14 


LOOP 



COP8 BENCHMARK #2— FOUR BYTE BINARY ADDITION 

This benchmark adds two four byte binary numbers and replaces the first operand with the result. This emulates an adding 
machine addition, where A + B replaces A. The benchmark is programmed as a subroutine, with the carry flag indicating an 
overflow. 



1/1 Set up address of 1st operand in B pointer 

2/3 Set up address of 2nd operand in X pointer 

1/1 Reset carry 

1/3 Load 2nd operand to A 

1/1 Add 1st operand to 2nd operand 

1/2 Result replaces 1st operand 

1/1 Test if addition finished 

1/3 Loop back if not finished 

1/5 Return from subroutine 



10 BYTES 
48 CYCLES 

M68HC05 BENCHMARK #2— FOUR BYTE BINARY ADDITION 

This benchmark adds two four byte binary numbers and replaces the first operand with the result. This emulates an adding 
machine addition, where A + B replaces A. The benchmark is programmed as a subroutine, with the carry flag indicating an 
overflow. 



2/2 Load index register with 1st operand address 

1/2 Clear carry 

1/3 Load A with 1st operand 

2/4 Add 2nd operand to A 

1/4 Replace 1st operand with result 

1/3 Increment index register 

2/2 Compare X with end of field 

2/3 Loop back if addition not finished 

1/6 Return from subroutine 

13 BYTES 
85 CYCLES 



ADDITION: 


LD 
CLC 


X, # 


LOOP: 


LDA 


X 




ADC 


4, X 




STA 


X 




INCX 






CPX 


#20 




BNE 


LOOP 




RTS 
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80C51 BENCHMARK #2— FOUR BYTE BINARY ADDITION 

This benchmark adds two four byte binary numbers and replaces the first operand with the result. This emulates an adding 
machine addition, where A + B replaces A. The benchmark is programmed as a subroutine, with the carry flag indicating an 
overflow. 



2/1 Load Rl pointer with address of 2nd operand 

2/1 Load RO pointer with address of 1st operand 

1/1 Clear carry 

1/1 Load 1st operand to A 

1/1 Add 2nd operand to A 

1/1 Replace 1st operand with result 

1/1 Increment 2nd operand pointer 

1/1 Increment 1st operand pointer 

3/2 lest if finished and loop back if not 

1/2 Return from subroutine 

14 BYTES 
33 CYCLES 



PIC16C5X BENCHMARK #2— FOUR BYTE BINARY ADDITION 

This benchmark adds two four byte binary numbers and replaces the first operand with the result. This emulates an adding 
machine addition, where A + B replaces A. The benchmark is programmed as a subroutine, with the carry flag indicating an 
overflow. 



ADDITION: MOV 
MOV 
CLR 

LOOP: MOV 

ADD 

MOV 

INC 

INC 

CJNE 

RET 



Rl, #20 

RO, #16 

C 

A, @R0 

A, @R1 

@R0, A 

Rl 

RO 

RO, #20, LOOP 



STATUS 


EQU F3 




BASE 


EQU F26 




CNTR 


EQU F25 




TEMP 


EQU F24 




ADDITION: MOVLW 


4 


1/1 


MOVWF 


CNTR 


1/1 


MOVLW 


20 


1/1 


MOWWF 


BASE 


1/1 


BCF 


STATUS, 


1/1 


LOOP : MOVF 


BASE, W 


1/1 


MOVWF 


F4 


1/1 


MOVF 


FO, W 


1/1 


MOVWF 


TEMP 


1/1 


MOVLW 


4 


1/1 


SUBWF 


F4 


1/1 


MOVF 


TEMP, W 


1/1 


ADDWF 


FO 


1/1 


MOVLW 


4 


1/1 


ADDWF 


F4 


1/1 


INCF 


BASE 


1/1 


DECFSZ 


CNTR 


1/1 


GOTO 


LOOP 


1/2 


RETLW 





1/2 



Status register 

1st operand @ F16-F19 2nd operand @ F20-F23 



Load W with byte count (length of field) 

Store byte count in CNTR 

Load W with address of 2nd operand 

Store address of 2nd operand in BASE 

Clear carry (carry is bit of F3 register) 

2nd operand address to W 

2nd operand address to F4 (indirect pointer) 

2nd operand to W 

2nd operand to TEMP 

Byte count to W 

Subtract from pointer to get 1st operand address 

2nd operand to W 

Add 2nd operand to 1st operand 

Byte count to W 

Add to pointer to restore 2nd operand address 

Increment BASE 

Test if addition finished 

Loop back if addition not finished 

Return from subroutine 



19 WORDS (Equivalent to 28 1/2 BYTES) 
62 CYCLES 



11 
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COP8 BENCHMARK #3— FOUR BYTE PACKED BCD SUBTRACTION 

This benchmark subtracts two eight digit packed BCD numbers (four bytes each) and replaces the minuend (first operand) with 
the result. This emulates an adding machine subtraction, where A - B replaces A. The benchmark is programmed as a subrou- 
tine, with the carry flag being used to indicate a positive or negative result (carry set for negative result). The BCD decimal 
correct (DCOR) command is used following the subtraction to achieve the correct BCD result. 



BCDSUBI: 



LOOP: 



NEGR: 



LD 

LD 

SC 

LD 

X 

SUBC 

DCOR 

X 

IFBNE 

JP 

IFNC 

JP 

RC 

RET 

SC 

LD 

CLR 

SUBC 

DCOR 

X 

IFBNE 

JP 

SC 

RET 



#14 
#20 



A, 

A, 

A, 

A 

A, 

#2 

LOOP 

NEGR 



[X+] 
[B] 

[B] 

[B+] 



A 
A, 
A 
A, 

#2 
LUP 



#14 
[B] 
[B+] 



1/1 Set up address of 1st operand In B pointer 

2/3 Set up address of 2nd operand in X pointer 

1/1 Set carry for no input borrow to subtraction 

1/3 Load 2nd operand to A 

1/1 Exchange 1st and 2nd operands 

1/1 Subtract 2nd from 1st operand 

1/1 Decimal correct result of subtraction 

1/2 Result replaces 1st operand 

1/1 Test (modulo 16) if subtraction finished (18-16=2) 

1/3 Loop back if not finished 

1/1 Test if result negative (borrow=no carry) 

1/3 Jump if result negative 

1/1 Reset carry to indicate positive result 

1/5 Return from subroutine 

1/1 Set carry for no input borrow to subtraction 

1/1 Reinitialize B pointer to start of result 

1/1 Clear A to set up subtraction from zero 

1/1 Subtract result from zero 

1/1 Decimal correct new result 

1/2 Store new result 

1/1 Test (modulo 16) if subtraction finished (18-16=2) 

1/3 Loop back if not finished 

1/1 Set carry to indicate negative result 

1/5 Return from subroutine 

25 BYTES 
97 CYCLES 
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M68HC05 BENCHMARK #3— FOUR BYTE PACKED BCD SUBTRACTION 


This benchmark subtracts two eight digit packed BCD numbers (four bytes each) and replaces the minuend (first operand) with 
the result. This emulates an adding machine subtraction, where A - B replaces A. The benchmark is programmed as a subrou- 
tine, with the carry flag being used to indicate a positive or negative result (carry set for negative result). Note in the M68HC05 
subtraction that the carry represents the borrow, unlike some microcontrollers where the carry represents the absence of 
borrow in subtraction. The M68HC05 does not have any decimal correct or decimal adjust instructions, so the BCD correction 
must be program implemented. The correction algorithm for BCD subtraction consists of subtracting six whenever a nibble 
borrow occurs from the BCD subtraction result for each digit. 




TEMP 


EQU $91 








FLAG 


EQU $92 






BCDSUBT 


: JSR 


WIT 


2/5 


Call initialization subroutine 


LOOP: 


LDA 


X 


1/3 


Load A with 1st operand 




SBC 


4, X 


2/4 


Subtract 2nd operand from A 




JSR 


CORRECT 


2/5 


Call correction subroutine 




BNE 


LOOP 


2/3 


Loop back if subtraction not finished 




BCS 


NEGR 


2/3 


Branch to NEGR (neg. result) if carry- 




RTS 




1/6 


Return from subroutine 


NEGR: 


JSR 


WIT 


2/5 


Call initialization subroutine 


LUP: 


CLRA 




1/3 


Clear A 




SBC 


X 


1/3 


Subtract result of previous subtraction from zero 




JSR 


CORRECT 


2/5 


Call correction subroutine 




BNE 


LUP 


2/3 


Loop back if subtraction from zero not finished 




SEC 




1/2 


Set carry to indicate negative result 




RTS 




1/6 


Return from subroutine 


WIT: 


LD 


X, #16 


2/2 


Load index register with 1st operand address 




CLC 




1/2 


Clear carry (reset borrow) 




CLRA 




1/2 


Clear A 




STA 


FLAG 


2/4 


Clear FLAG register 




RTS 




1/6 


Return from initialization subroutine 


CORRECT 


: STA 


X 


1/4 


Replace 1st operand with result 




CLRA 




1/3 


Clear A 




BCC 


BYP1 


2/3 


Branch if carry set (no borrow) 




BSET 


0, FLAG 


2/5 


Set flag if carry (save carry value) 




LDA 


#$60 


2/2 


Load A with packed BCD 60 


BYP1: 


BHCC 


BYP2 


2/3 


Branch if half carry set (no half borrow) 




ADD 


#$06 


2/2 


Add packed BCD 06 to A 


BYP2: 


STA 


TEMP 


2/4 


Store A in TEMP 




LDA 


X 


1/3 


Load A with result of first subtraction 




SUB 


A, TEMP 


2/3 


Subtract TEMP from previous result 




BRSET 


0, FLAG, NXT 


3/5 


Restore carry (flag value to carry) 


NXT: 


STA 


X 


1/4 


Store new result 




INCX 




1/3 


Increment index register 




CPX 


#20 


2/2 


Compare X with end of field 




RTS 




1/6 


Return from correction subroutine 






C 


4 BYTES 






C 


67 CYCLES 
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80C51 BENCHMARK #3— FOUR BYTE PACKED BCD SUBTRACTION 


This benchmark subtracts two eight digit packed BCD numbers (four bytes each) and replaces the minuend (first operand) with 


the result. This emulates an adding machine subtraction, where A - B replaces A. The benchmark is programmed as a subrou- 


tine, with the carry flag being used to indicate a positive or negative result (carry set for negative result). It should be noted in the 


80C51 subtraction that the carry represents the borrow, unlike some microcontrollers where the carry represents the absence of 


borrow in subtraction. 






The BCD decimal adjust (DA) command or 


ly works following addition, not subtraction. Consequently, the BCD subtraction must 


be implemented as 


an addition by adding 


the com 


plement of the subtrahend (2nd operand) to the 1st operand. This comple- 


ment is achieved by subtracting the subtrahend from a packed BCD 99 (binary 10011001), and then adding one to the result. 


The following are examples of this complementation procedure: 


1.61-49 = 61 + 


(99-49) + 1 = 61 + 50 + 


1=12 (mod 100) (Output borrow shows positive result) 


2. 49-61 = 49 + 


(99-61) + 1 = 49 + 38 + 


1 = 88 (No output borrow indicates negative result) 


Negative result correction: - (99-88 4 


D = - 


-12 


BCDSUBI: MOV 


R2, #4 


2/1 


Load R2 with byte count 


MOV 


Rl, #20 


2/1 


Load Rl with address of 2nd operand 


MOV 


RO, #16 


2/1 


Load RO with address of 1st operand 


CLR 


C 


1/1 


Clear carry for no input borrow to subtraction 


LOOP: MOV 


A, #099H 


2/1 


Load accumulator with packed BCD 99 


SUBB 


A, §R1 


1/1 


Subtract 2nd operand from packed BCD 99 


ADD 


A, #01 


2/1 


Add one to result 


DA 


A 


1/1 


Decimal correct result 


ADD 


A, §R0 


1/1 


Add 1st operand to result 


DA 


A 


1/1 


Decimal correct new result 


MOV 


@R0, A 


1/1 


Replace 1st operand with result 


INC 


Rl 


1/1 


Increment 2nd operand pointer 


INC 


RO 


1/1 


Increment 1st operand pointer 


DJNZ 


R2, LOOP 


2/2 


Test if finished and loop back if not 


JC 


NEGR 


2/2 


Test and jump if negative result 


RET 




1/2 


Return from subroutine (no carry: positive result) 


NEGR : CLR 


C 


1/1 


Clear carry for no input borrow to subtraction 


MOV 


RO, #16 


2/1 


Load RO with address of result 


LUP : MOV 


A,#099H 


2/1 


Load accumulator with packed BCD 99 


SUBB 


A, @R0 


1/1 


Subtract result from packed BCD 99 


ADD 


A, #01 


2/2 


Add one to result 


DA 


A 


1/1 


Decimal correct result 


MOV 


@R0, A 


1/1 


Store result 


INC 


RO 


1/1 


Increment pointer 


CJNZ 


RO, #20, LUP 


3/2 


Test if finished and loop back if not 


SEIB 


C 


1/1 


Set carry to indicate negative result 


RET 




1/2 


Return from subroutine (carry: negative result) 




- 


9 BYTES 




c 


1 CYCLES 
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PIC16C5X BENCHMARK #3— FOUR BYTE PACKED BCD SUBTRACTION PAGE 1 OF 2 


This benchmark subtracts two eight digit packed BCD numbers (four bytes each) and replaces the minuend (first operand) with 
the result. This emulates an adding machine subtraction, where A - B replaces A. The benchmark is programmed as a subrou- 
tine, with the carry flag being used to indicate a positive or negative result (carry set for negative result). The PIC1 6C5X does not 
have any decimal adjust or decimal correct instructions, so the BCD correction must be program implemented. The correction 
algorithm for BCD subtraction consists of subtracting six whenever a nibble borrow occurs from the BCD subtraction result for 
each digit. 




STATUS 


EQU F3 




Status register 




FLAG 


EQU F27 








BASE 


EQU F26 




1st operand @ F16-F19 2nd operand @ F20-F23 




CNTR 


EQU F25 








TEMP 


EQU F24 






BCDSUBT 


: MOVLW 


4 


1/1 


Load W with byte count (length of field) 




MOVWF 


CNTR 


1/1 


Store byte count in CNTR 




MOVLW 


20 


1/1 


Load W with address of 2nd operand 




MOVWF 


BASE 


1/1 


Store address of 2nd operand in BASE 




BSF 


STATUS, 


1/1 


Set carry (reset borrow) (STATUS register bit 0) 


LOOP: 


MOVF 


BASE, W 


1/1 


2nd operand address to W 




MOVWF 


F4 


1/1 


2nd operand address to F4 (indirect pointer) 




MOVF 


FO, W 


1/1 


2nd operand to W 




MOVWF 


TEMP 


1/1 


2nd operand to TEMP 




MOVLW 


4 


1/1 


Byte count to W 




SUBWF 


F4 


1/1 


Subtract from pointer to get 1st operand address 




CALL 


CORRECT 


1/2 


Call CORRECT subroutine 




MOVLW 


4 


1/1 


Byte count to W 




ADDWF 


F4 


1/1 


Add to pointer to restore 2nd operand address 




DECFSZ 


CNTR 


1/1 


Test if subtraction finished 




GOTO 


LOOP 


1/2 


Loop back if subtraction not finished 




BTFSS 


STATUS, 


1/1 


Test if output borrow (no carry) 




GOTO 


NEGR 


1/2 


Branch to NEGR (neg. result) if output borrow 




BCF 


STATUS, 


1/1 


Clear carry to indicate positive result 




RETLW 





1/2 


Return from subroutine 


NEGR: 


MOVLW 


4 


1/1 


Load W with byte count (length of field) 




MOVWF 


CNTR 


1/1 


Store byte count in CNTR 




MOVLW 


16 


1/1 


Load W with address of result (old 1st operand) 




MOVWF 


BASE 


1/1 


Store address of result in BASE 




BSF 


STATUS, 


1/1 


Set carry (reset borrow) 


LUP: 


MOVF 


BASE, W 


1/1 


Result address to W 




MOVWF 


F4 


1/1 


Result address to F4 (indirect pointer) 




MOVF 


FO, W 


1/1 


Result to W 




MOVWF 


TEMP 


1/1 


Result to TEMP 




CLRF 


FO 


1/1 


Zero to result (old 1st operand) 




CALL 


CORRECT 


1/2 


Call CORRECT subroutine 




DECFSZ 


CNTR 


1/1 


Test if subtraction finished 




GOTO 


LUP 


1/2 


Loop back if subtraction not finished 




BSF 


STATUS, 


1/1 


Set carry to indicate negative result 




RETLW 





1/2 


Return from subroutine 






35 WORDS 



15 
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PIC16C5X BENCHMARK #3— FOUR BYTE BCD SUBTRACTION (Continued) 



CORRECT: 


CLRF 


FLAG 




1/1 




MOVF 


TEMP, W 




1/1 




SUBWF 


FO 




1/1 




CLRF 


TEMP 




1/1 




BTFSC 


STATUS, 





1/1 




GOTO 


BYP1 




1/2 




MOVLW 


60H 




1/1 




ADDWF 


TEMP 




1/1 


BYP1: 


BSF 


FLAG, 




1/1 




BTFSC 


STATUS, 


1 


1/1 




GOTO 


BYP2 




1/2 




MOVLW 


06H 




1/1 




ADDWF 


TEMP 




1/1 


BYP2: 


MOVF 


TEMP, W 




1/1 




BSF 


STATUS, 





1/1 




SUBWF 


FO 




1/1 




BTFSC 


FLAG, 




1/1 




BCF 


STATUS, 





1/1 




INCF 


BASE 




1/1 




REILW 





2 


1/2 

WORDS 



Clear FLAG register 

Move TEMP to W 

Subtract W from register 

Clear TEMP 
Test if carry (no borrow) 
Branch if carry (no borrow) 
Load W with packed BCD 60 
Add W to TEMP 
Carry to FLAG bit 

Test if digit carry (no digit borrow) 
Branch if digit carry (no digit borrow) 
Load W with packed BCD 06 
Add W to TEMP 
Move TEMP to W 
Set carry (reset borrow) 
Subtract W from previous result 
Test if FLAG bit reset 
Clear carry (set borrow) 
Increment BASE 
Return from CORRECT subroutine 



TOTAL: 



55 WORDS (Equivalent to 82 1/2 BYTES) 
274 CYCLES 
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C0P8 BENCHMARK #4— THREE BYTE TABLE SEARCH 

This benchmark searches a 200 byte table (resident in program memory) for a three byte character string, which may be 
resident anywhere in the lookup table (not necessarily on three byte boundaries). The type of return from subroutine (RETSK 
versus RET) indicates the success or failure of the search, with the address of the first byte of a matched string being returned in 
BASE. The benchmark is programmed as a subroutine, which should be located following the table since the LAID instructions 
in the subroutine must be located in the same 256 byte page of program memory. 



Three bytes of character string 
resident in registers 00, 01, 02 



Address of SIZE to B pointer 

Table size (200) minus 2 to SIZE 

Table base address of to BASE 

1st byte of table address to A 

Save 1st byte address in TEMP 

Restore 1st byte table address to A 

Load 1st of 3 test bytes from prog, memory table 

Test if 1st byte match 

First byte match 

Fail if mismatch 

Restore 1st byte address to A 

Increment address to get 2nd byte table address 

Save 2nd byte address in TEMP 

Restore 2nd byte table address to A 

Load 2nd of 3 test bytes from prog, memory table 

Test if 2nd byte match 

Second byte match 

Fail if mismatch 

Restore 2nd byte address to A 

Increment address to get 3rd byte table address 

Save 3rd byte address in TEMP 

Restore 3rd byte table address to A 

Load 3rd of 3 test bytes from prog, memory table 

Test if 3rd byte match 

RETURN and SKIP if three byte match found 

Decrement B pointer to select BASE 

Restore 1st byte address to A from BASE 

Increment 1st byte address 

Save new 1st byte address in BASE 

Decrement SIZE and skip if table search finished 

Continue table search 

RETURN if three byte match not found 



*First search iteration fails with first byte 42 BYTES 
mismatch, second search iteration successful 77 CYCLES* 





CHAR1 


= 00 






CHAR2 


= 01 






CHAR 3 


= 02 






SIZE = 


0F0 






BASE = 


0F1 






TEMP = 


0F2 




TBLSRCH: 


LD 


B, #SIZE 


2/3 




LD 


[B+], #198 


2/2 




LD 


[B], #0 


2/2 


SEARCH : 


LD 


A, [B+] 


1/2 




X 


A, [B] 


1/1 




LD 


A, [B] 


1/1 




LAID 




1/3 




IFEQ 


A, CHAR1 


3/4 




JP 


SEARCH2 


1/3 




JP 


FAIL 


1/3 


SEARCH2 : 


LD 


A, [B] 


1/1 




INC 


A 


1/1 




X 


A, [B] 


1/1 




LD 


A, [B] 


1/1 




LAID 




1/3 




IFEQ 


A, CHAR2 


3/4 




JP 


SEARCH3 


1/3 




JP 


FAIL 


1/3 


SEARCH3 : 


LD 


A, [B] 


1/1 




INC 


A 


1/1 




X 


A, [B] 


1/1 




LD 


A, [B] 


1/1 




LAID 




1/3 




IFEQ 


A, CHAR3 


3/4 


SUCCESS: 


RETSK 




1/5 


FAIL: 


LD 


A, [B-] 


1/2 




LD 


A, [B] 


1/1 




INC 


A 


1/1 




X 


A, [B] 


1/1 




DRSZ 


SIZE 


1/3 




JMP 


SEARCH 


2/3 




RET 




1/5 
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68HC05 BENCHMARK #4— THREE BYTE TABLE SEARCH 

This benchmark searches a 200 byte table (resident in program memory) for a three byte character string, which may be 
resident anywhere in the lookup table (not necessarily on three byte boundaries). The status of the carry bit indicates the 
success or failure of the search, with the address of the first byte of a matched string being returned in BASE. The benchmark is 
programmed as a subroutine. 



2/2 Set up table size (200) minus 2 

2/4 Save table size 

1/3 Set up table base address of 

2/4 Save table base address 

2/4 Get first byte from table 

2/3 Compare 1st byte with CHAR1 

2/3 Fail 

1/3 Set up address of 2nd byte from table 

2/4 Get second byte from table 

2/3 Compare 2nd byte with CHAR2 

2/3 Fail 

1/3 Set up address of 3rd byte from table 

2/4 Get third byte from table 

2/3 Compare 3rd byte with CHAR3 

2/3 Fail 

1/2 Set carry to indicate three byte match found 

1/6 Return 

2/3 Restore base address 

1/3 Increment base address to get new 1st byte address 

2/4 Save new base address 

2/3 Compare new base address with table size 

2/3 Continue table search 

1/2 Reset carry to indicate no three byte match found 

1/6 Return 



40 BYTES 
80 CYCLES* 

*First search iteration fails with first byte mismatch, second search iteration successful 



TBLSRCH: 


LDA 


#198 




STA 


SIZE 




CLX 






STX 


BASE 


SEARCH : 


LDA 


X 




CMP 


CHAR1 




BNE 


FAIL 




INCX 






LDA 


X 




CMP 


CHAR2 




BNE 


FAIL 




INCX 






LDA 


X 




CMP 


CHAR3 




BNE 


FAIL 


SUCCESS: 


SEC 
RTS 




FAIL: 


LDX 


BASE 




INCX 






STX 


BASE 




CPX 


SIZE 




BNE 


SEARCH 




CLC 






RTS 
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80C51 BENCHMARK #4— THREE BYTE TABLE SEARCH 

This benchmark searches a 200 byte table (resident in program memory) for a three byte character string, which may be 
resident anywhere in the lookup table (not necessarily on three byte boundaries). The status of the carry bit indicates the 
success or failure of the search, with the address of the first byte of a matched string being returned in DPTR. The benchmark is 
programmed as a subroutine. 



TBLSRCH: 


MOV 




MOV 


SEARCH : 


CLR 




MOVC 




CJNE 




MOV 




MOVC 




CJNE 




MOV 




MOVC 




CJNE 


SUCCESS: 


SET 




RET 


FAIL: 


INC 




DJNZ 




CLR 




RET 



DPTR, #TBLBASE 

R2, #198 

A 

@A + DPTR 
CHAR1, FAIL 

#1 

@A + DPTR 

CHAR2, FAIL 

#2 

@A + DPTR 

CHAR3, FAIL 



3/2 Set up table starting address 

2/1 Set up table size (200) minus 2 

1/1 Clear accumulator 

1/2 Get 1st byte from program memory table 

3/2 Compare 1st byte with CHAR1 

2/1 Load accumulator with offset of 1 

1/2 Get 2nd byte from table 

3/2 Compare 2nd byte with CHAR2 

2/1 Load accumulator with offset of 2 

1/2 Get 3rd byte from table 

3/2 Compare 3rd byte with CHAR3 

1/1 Set carry to indicate three byte match found 

1/2 Return from subroutine 

1/2 Increment data pointer to get new 

1st byte address 

2/2 Decrement size and test if 

table search finished 

1/1 Clear carry to indicate no three 

byte match found 

1/2 Return from subroutine 



29 BYTES 

30 CYCLES* 

*First search iteration fails with first byte mismatch, second search iteration successful 



DPTR 



R2, SEARCH 
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PIC16C5X BENCHMARK #4— THREE BYTE TABLE SEARCH 

This benchmark searches a 200 word table (resident in program memory) for a three byte character string, which may be 
resident anywhere in the lookup table (not necessarily on three word boundaries). The status of the carry bit indicates the 
success or failure of the search, with the address of the first byte of a matched string being returned in OFFSET. The benchmark 
is programmed as a subroutine, which in turn calls the table header (TABLE) as a subroutine. This table header should 
immediately precede the 200 word table. 



1/1 Set up table size (200) minus 2 

1/1 Save table size 

1/1 Initialize table offset from table header 

1/1 Save offset 

1/1 Load offset 

1/2 Get 1st byte from table 

1/1 Subtract 1st test byte from CHAR1 

1/1 lest if result is zero (F3, bit 2) 

1/2 Fail if mismatch 

1/1 Increment offset to set up 2nd byte 

1/1 Load offset 

1/2 Get 2nd byte 

1/1 Subtract 2nd test byte from CHAR2 

1/1 Test if result is zero (F3, bit 2) 

1/2 Fail if mismatch 

1/1 Increment offset to set up 3rd byte 

1/1 Load offset 

1/2 Get 3rd byte 

1/1 Subtract 3rd test byte from CHAR3 

1/1 lest if result is zero (F3, bit 2) 

1/2 Fail if mismatch 

1/1 Decrement offset to return to 2nd byte 

1/1 Decrement offset to return to 1st byte 

1/1 Set carry (F3, bit 0) to indicate match 

found 

1/2 Return 

1/1 Decrement offset to set up 2nd byte 

1/1 Decrement offset to set up 1st byte 

1/1 Increment offset to set up new 1st byte 

1/1 Decrement size and skip if table search 

finished 

1/2 Continue table search 

1/1 Clear carry (F3, bit 0) to indicate no 

match found 

1/2 Return 

1/1 Add offset to PC 

1/2 1st entry of data table 

-/- 2nd entry of data table 



TBLSRCH: MOVLW 


198 


MOVWF 


SIZE 


MOVLW 


1 


MOVWF 


OFFSET 


SEARCH : MOVF 


W 


CALL 


TABLE 


SUBWF 


CHAR1 


BTFSS 


STATUS 


GOTO 


FAIL1 


INCF 


OFFSET 


MOVF 


W 


CALL 


TABLE 


SUBWF 


CHAR2 


BTFSS 


STATUS 


GOTO 


FAIL2 


INCF 


OFFSET 


MOVF 


W 


CALL 


TABLE 


SUBWF 


CHAR3 


BTFSS 


STATUS 


GOTO 


FAIL3 


SUCCESS: DECF 


OFFSET 


DECF 


OFFSET 


BSF 


STATUS 



FAIL3: 



RETLW 
DECF OFFSET 
DECF OFFSET 



FAIL1: 


INCF 


OFFSET 




FAIL2 : 


DECFSZ 


SIZE 






GOTO 


SEARCH 






BCF 


STATUS, 


C 



TABLE : ADDWF PC 

RETLW Data 1 
RETLW Data 2 



*First search iteration fails with first byte 34 WORDS (Equivalent to 51 BYTES) 
mismatch, second search iteration successful 52 CYCLES* 



RETLW Program table entry 1 
RETLW Program table entry 2 



RETLW Program table entry 200 



1/2 This 200 word table contains 300 bytes 
1/2 

1/2 
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C0P8 BENCHMARK #5- 


-INPUT/OUTPUT MANIPULATION 




This benchmark compares two 8-bit I/O ports P1 


and P2. If they are equal, a nine is output as the least significant digit (lower 


nibble) of 


a third port P3. 


f port P1 is greater than port P2, then port P2 is output on port P1. If port P1 is less than port P2, then 


the most 


significant digit of Port P1 is copied to the least significant digit of Port P3. 




PORTLD = 


0D0 












PORILC = 


0D1 












PORILI = 


0D2 












PORIGD = 


0D4 












PORIGC = 


0D5 












PORTGI = 


0D6 












PORID = 


ODC 










PORICMP 


: LD 


B, #P0RTLI 






2/3 


Load B pointer (PORIL is PI) 




LD 


X, #P0RTGI 






2/3 


Load X pointer (PORTG is P2) 




SC 








1/1 


Initialize carry (no borrow) for subtraction 




LD 


A, [X] 






1/1 


Load P2 




SUBC 


A, [B] 






1/1 


Subtract PI from P2 




IFC 








1/1 


lest if P2 greater or equal to PI 




JP 


POS 






1/3 


Branch if result positive 


NEG: 


LD 


A, [B-] 






1/2 


Decrement B pointer 




LD 


[B-] , #0FF 






2/2 


Configure PORIL (PI) as output port 




LD 


A, [X] 






1/3 


Load P2 to A 




X 


A, [B] 






1/1 


Output P2 to PI 




JP 


FIN 






1/3 


Jump to finish 


POS: 


IFEQ 


A, #0 






2/2 


Test if result zero 




JP 


EQUAL 






1/3 


Branch if zero 




LD 


A, [B] 






1/1 


Load PI to A 




SWAP 


A 






1/1 


Swap nibbles of A 




X 


A, PORTD 






2/3 


Result to P3 (PORTD) 




JP 


FIN 






1/3 


Jump to finish 


EQUAL : 


LD 


PORID, #09 






3/3 


Digit 9 to P3 (PORTD) 


FIN: 



















26 


BYTES 






PI < P2 


24 


CYCLES 






PI = P2 


21 


CYCLES 






PI > P2 


22 


CYCLES 
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68HC05 BENCHMARK #5— INPUT/OUTPUT MANIPULATION 


This benchmark compares two 8-bit I/O ports P1 and P2. If they are equal, a nine is output as the least significant digit (lower 
nibble) of a third port P3. If Port P1 is greater than port P2, then port P2 is output on Port P1 . If Port P1 is less than Port P2, then 
the most significant digit of Port P1 is copied to the least significant digit of Port P3. 




PORTA 


EQU $00 




Port PI 




PORTB 


EQU $01 




Port P2 




PORTC 


EQU $02 




Port P3 




DDRA 


EQU $04 




PORTA configuration register 




DDRB 


EQU $05 




PORTB configuration register 




DDRC 


EQU $06 




PORTC configuration register 




TEMP 


EQU $9F 




Temporary register 


PORTCMP 


: CLA 




1/3 


Clear A 




STA 


DDRA 


2/4 


Configure Port A as input port (PI) 




STA 


DDRB 


2/4 


Configure Port B as input port (P2) 




DECA 




1/3 


Decrement A to all ones 




STA 


DDRC 


2/4 


Configure Port C as output port (P3) 




LDA 


PORTB 


2/3 


Load A with Port P2 data 




CMP 


A, PORTA 


2/3 


Compare Port PI data with Port P2 data 




BPL 


POS 


2/3 


Branch if comparison result (P2-P1) positive 


NEG: 


LDA 


#$FF 


2/2 


Load A with all ones 




STA 


DDRA 


2/4 


Configure Port A as output port (PI) 




LDA 


PORTB 


2/3 


Port P2 data to A 




STA 


PORTA 


2/4 


Output Port P2 data to Port PI 




BRA 


FIN 


2/3 


Branch to FIN 


POS: 




BEQ EQUAL 


2/3 


Branch if comparison result equal 




LDA 


PORTA 


2/3 


Port PI data to A 




AND 


A, #$F0 


2/2 


Extract high order nibble of Port PI data 




LSRA 




1/3 


Logical shift A right one bit four times 




LSRA 




1/3 


in order to shift upper nibble 




LSRA 




1/3 


down into lower nibble position 




LSRA 




1/3 






STA 


PORTC 


2/4 


Output result to Port P3 




BRA 


FIN 


2/3 


Branch to FIN 


EQUAL : 


LDA 


#9 


2/2 


Load A with digit 9 




STA 


PORTC 


2/4 


Output digit 9 to Port P3 


FIN: 







-/- 








42 BYTES 






PI < P2 53 CYCLES 






PI = P2 36 CYCLES 






PI > P2 42 CYCLES 
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80C51 BENCHMARK #5— INPUT/OUTPUT MANIPULATION 

This benchmark compares two 8-bit I/O ports P1 and P2. If they are equal, a nine is output as the least significant digit (lower 
nibble) of a third port P3. If Port P1 is greater than Port P2, then Port P2 is output on Port P1 . If Port P1 is less than Port P2, then 
the most significant digit of Port P1 is copied to the least significant digit of Port P3. 



PORICMP: 


MOV 




MOV 




CLR 




MOV 




SUBB 




JNC 


NEG: 


MOV 




MOV 




AJMP 


POS: 


JZ 




MOV 




SWAP 




ANL 




MOV 




AJMP 


EQUAL : 


MOV 


FIN: 






PI, #0FF 

P2, #0FF 

C 

A, P2 

A, PI 

POS 

A, P2 

PI, A 

FIN 

EQUAL 

A, PI 

A 

A, #00F 

P3, A 

FIN 

P3, #9 



PI < P2 
PI = P2 
PI > P2 



3/2 
3/2 
1/1 
2/1 
2/1 
2/2 
2/1 
2/1 
2/2 
2/2 
2/1 
1/1 
2/1 
2/1 
2/2 
3/2 
-/- 

33 BYTES 



17 CYCLES 
11 CYCLES 
13 CYCLES 



Configure Port PI for input 

Configure Port P2 for input 

Clear carry 

Load A with Port P2 data 

Subtract Port PI data from A 

Jump TO POS if no carry from subtract 

Load A with Port P2 data 

Output Port P2 data to Port PI 

Jump to FIN 

Jump to EQUAL if subtraction result zero 

Load A with Port PI data 

Swap nibbles of A 

Extract lower nibble of A 

Output Port PI upper nibble to Port P3 

Jump to FIN 

Output digit 9 to Port P3 
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FIN: 



PIC16C5X BENCHMARK #5— INPUT/OUTPUT MANIPULATION 

This benchmark compares two 8-bit I/O ports P1 and P2. If they are equal, a nine is output as the least significant digit (lower 
nibble) of a third Port P3. If Port P1 is greater than Port P2, then Port P2 is output on Port P1 . If Port P1 is less than Port P2, then 
the most significant digit of Port P1 is copied to the least significant digit of Port P3. 



Status register 
Serves as Fort P3 
Serves as Fort F2 
Serves as Port Fl 
Temporary register 

1/1 Load W with all ones 

1/1 Iristate PORIB 

1/1 Tristate PORIC 

1/1 Load W with PORTS input 

1/1 Store PORTB input in TEMP 

1/1 Load W with PORTC input 

1/1 Set carry (bit of STATUS) 

1/1 Subtract PI from P2 

1/1 Clear W 

1/1 Configure PORTA as output port 

1/1 Test if P2 less than PI 

1/2 Branch to POS if test fails 

1/1 Change PORTC to output port 

1/1 Load W with PORTB (P2) input 

1/1 P2 input data output to PORTC (PI) 

1/2 Branch to FIN 

1/1 Test if subtraction result non-zero 

1/2 Branch to EQUAL if test fails 

1/1 Load W with PORTC (PI) input 

1/1 PI input to TEMP 

1/1 Swap nibbles of TEMP with result to W 

1/1 Output result from W to PORTA (P3) 

1/2 Branch to FIN 

1/1 Digit 9 to W 

1/1 Digit 9 output to PORT A (P3) 

-/- 

25 WORDS (Equivalent to 37 1/2 BYTES) 

21 CYCLES 
18 CYCLES 
17 CYCLES 





STATUS 


EQU F3 




PORTA 


EQU F5 




PORTB 


EQU F6 




PORTC 


EQU F7 




TEMP 


EQU F8 


PORT CMP: 


MOVLW 


FFH 




TRIS 


PORTB 




TRIS 


PORTC 




MOVF 


PORTB, 




MOVWF 


TEMP 




MOVF 


PORTC, 




BSF 


STATUS 




SUBWF 


TEMP 




CLRW 






TRIS 


PORTA 




BTFSC 


STATUS 




GOTO 


POS 


NEG: 


TRIS 


PORTC 




MOVF 


PORTB, 




MOVWF 


PORTC 




GOTO 


FIN 


POS: 


BTFSC 


STATUS 




GOTO 


EQUAL 




MOVF 


PORTC, 




MOVWF 


TEMP 




SWAPF 


TEMP, 




MOVWF 


PORTA 




GOTO 


FIN 


EQUAL : 


MOVLW 


9 




MOVWF 


PORTA 



PI < P2 
PI = P2 
PI > P2 



http://www.national.com 



24 



C0P8 BENCHMARK #6— SERIAL INPUT/OUTPUT WITH OFFSET TABLE 


This benchmark inputs a sequence of byte couplets, with each couplet consisting of an 8-bit offset address followed by a data 
byte. The address is used to access a program memory data table to produce an offset value which is added to the second byte 
of the input couplet. The updated data byte is output while the address byte of the next couplet is being input. The benchmark is 
written as a subroutine, with the size of the input stream being an input parameter. 


The program is written using the MICROWIRE/PLUS fast burst technique, which utilizes a 500 kHz burst clock SK (instruction 
cycle clock divided by 2). Each byte of the I/O stream utilizes nine burst clock periods, yielding an effective byte rate of 55.6 
kHz. 


*** Setting the Microwire busy bit to initialize 
program to work. Alternate cycle definitions 


the Microwire must occur at the same time in each alternating 1 8 cycle loop for this 
are as follows: 




Even cycle : 


New address input for table lookup offset value 
Previous data result output 




Odd cycle : 


New dat 


a input added to offset value from table lookup 






New dat 


a input returned as output 




PORTGD = 0D4 






P0RTG configuration register 




PORTGC = 0D5 






PORTG data register 




SIOR = 0E9 






Serial Input/Output register 




CNTRL = OEE 






Control register 




PSW = OEF 






Program Status Word 




SIZE = 0F0 






Size of data stream 




BUSY = 2 






Microwire busy bit in PSW register 


UPDATE : 


LD PORTGC, #030 




3/3 


Configure G4, G5 as outputs SO, SK to 




LD PORTGD, #0 




3/3 


select uwire master mode (SI is G6) 




INC A 




1/1 


Increment byte count to compensate for one 




X A, SIZE 




2/3 


byte throughput and store result in SIZE 




LD B, #CNTRL 




2/3 


Set up B pointer for CNTRL register 




LD [B+] , #8 




2/2 


Select uwire master with divide by 2 clock 




RC 




1/1 


Initialize carry 


LOOP: 


IFNC 




1/1 


1 1 Test if no carry 




JP BYP1 




1/3 


1 3 Branch if no carry 




NOP 




1/1 


1 - NOP for delay compensation for 18 cycle loop 




ADD A, SIOR 




3/4 


4 - Add uwire input data to table offset value 




SBIT BUSY, [B] 




1/1 


1 - ***Set uwire BUSY bit to start microwire 




RC 




1/1 


1 - Reset carry 




JP BYP2 




1/3 


3 - Branch 


BYP1: 


X A, SIOR 




2/3 


- 3 New address input & previous result output 




SBIT BUSY, [B] 




1/1 


- 1 ***Set uwire BUSY bit to start microwire 




LAID 




1/3 


- 3 Offset table lookup from program memory 




sc 




1/1 


- 1 Set carry 


BYP2: 


DRSZ SIZE 




1/3 


3 3 Decrement SIZE and test if result zero 




JP LOOP 




1/3 


3 3 Loop back if zero test fails 




RET 




1/5 


Return from subroutine 






31 BYTES 






18 CYCLES per Loop 
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M68HC05 BENCHMARK #6— SERIAL INPUT/OUTPUT WITH OFFSET TABLE 

This benchmark inputs a sequence of byte couplets, with each couplet consisting of an 8-bit offset address followed by a data 

byte. The address is used to access a data table to produce an offset value which is added to the second byte of the input 

couplet. The updated data byte is output while the address byte of the next couplet is being input. The benchmark is written as a 

subroutine, with the size of the input stream being an input parameter. 

The program is written using the SPI with the divide by 2 fast clock option selected, which yields a 1 MHz burst clock SCK 

(instruction cycle clock divided by 2). Each byte of the I/O stream utilizes 27 burst clock periods (equivalent to 13.5 p.s), yielding 

an effective byte rate of 74.1 kHz. 

** 'Setting the SPI system enable bit SPE to initialize the SPI must occur at the same time in each alternating 27 cycle loop for 

this program to work. Alternate cycle definitions are as follows: 



Even cycle : 
Odd cycle : 



PORTD EQU $03 

SPCR EQU $0A 

SPSR EQU $0B 

SPDR EQU $0C 



UPDATE : 


INCA 






STA 


A, SIZE 




LDA 


#$040 




STA 


SPCR 




CLC 




LOOP: 


BCC 


BYP1 




ADD 


SPDR 




STA 


SPDR 




BSET 


6, SPCR 




CLC 






BRA 


BYP2 


BYP1: 


LDA 


X 




LDX 


SPDR 




BSET 


6, SPCR 




LDA 


X 




SEC 




BYP2: 


DEC 


SIZE 




BNE 


LOOP 




RTS 





New address input for table lookup offset value 
Previous data result output 

New data input added to offset value from table lookup 
New data input returned as output 

SPI Interface port 
Serial Peripheral Control Register 
Serial Peripheral Status Register 
Serial Peripheral Data I/O Register 

1/3 Increment byte count to compensate for one 

2/4 byte throughput and store result in SIZE 

2/2 Set up data for SPCR register 

2/4 Enable SPI master with divide by 2 clock 

1/2 Initialize carry 

2/3 2 3 Test and branch if no carry 

2/3 3 - Add SPI input data to table offset value in A 

2/4 4 - Output result to SPDR for SPI output 

2/5 5 - ***Turn on SPI system enable bit 

1/2 2 - Clear carry 

2/3 3 - Branch 

1/3 - 3 NOP for delay compensation for 27 cycle loop 

2/3 - 3 SPI input data address to index register 

2/5 - 5 ***Turn on SPI system enable bit 

1/3 - 3 Load A with table lookup offset value 

1/2 - 2 Set carry 

2/5 5 5 Decrement SIZE 

2/3 3 3 Test and branch if result equal to zero 

1/6 Return from subroutine 

31 BYTES 

27 CYCLES per Loop 
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80C51 BENCHMARK #6— SERIAL INPUT/OUTPUT WITH OFFSET TABLE 

This benchmark inputs a sequence of byte couplets, with each couplet consisting of an 8-bit offset address followed by a data 

byte. The address is used to access a program memory data table to produce an offset value which is added to the second byte 

of the input couplet. The updated data byte is output while the address byte of the next couplet is being input. The benchmark is 

written as a subroutine, with the size of the input stream being an input parameter. 

The program is written using the 8-bit Shift Register Mode 0, with the serial data being either transmitted or received (not both 

simultaneously) with the least significant bit first. The clock rate for this mode is 1/12 that of the on-chip oscillator frequency of 

12 MHz, which equates to an I/O baud rate of 1 MHz. This is equivalent to the instruction cycle time of 1 lis. In Shift Register 

Mode 0, the RXD line is used for both data input and output, while the TXD line is used for the output clock. Consequently, the 

terms "RXD" and "TXD" are misleading in this mode of serial I/O. 

The program takes 52 instruction cycles per couplet loop, which equates to 26 instruction cycles (26 /as) per byte. This yields an 

effective byte rate of 38.5 kHz. 



SIZE 



EQU R2 



UPDATE : 


MOV 


SIZE, A 


1/1 




INC 


SIZE 


1/1 




MOV 


DPTR, #TBLBASE 


3/2 


LOOP: 


CLR 


C 


1/1 


SINP: 


MOV 


SCON, #010H 


3/2 


STALL1: 


JNB 


SCON.RI, SIALL1 


3/2 




JC 


BYP 


2/2 




MOV 


A, SBUF 


2/1 




MOVC 


A, @A + DPTR 


1/2 




SETB 


C 


1/1 




AJMP 


SINP 


2/2 


BYP: 


ADD 


A, SBUF 


2/1 




MOV 


SCON, #0 


3/2 


SOUI: 


MOV 


SBUF, A 


2/1 


SIALL2: 


JNB 


SCON. II, SIALL2 


3/2 




CLR 


SCON.TI 


2/1 




DJNZ 


SIZE, LOOP 


2/2 



1/2 



Save and increment byte count to 

compensate for one byte throughput 
Set up table starting address 
Clear carry 

Initialize SCON for Mode and enable receive 
Wait for receiving to finish 
Jump if carry 

Read SBUF to get data table address 
Offset value from program memory 
data table lookup 
Set carry 
Jump to SINP 

Add input data from SBUF to offset value 
Reinitialize SCON for Mode and no receive 
Result to SBUF starts serial output 
Wait for transmitting to finish 
Clear TI (transmit interrupt) flag 
Decrement & test SIZE and loop back 
if not finished 
Return from subroutine 



35 BYTES 

52 CYCLES per couplet Loop 
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PIC16C5X BENCHMARK #6— SERIAL INPUT/OUTPUT WITH OFFSET TABLE 

This benchmark inputs a sequence of byte couplets, with each couplet consisting of an 8-bit offset address followed by a data 
byte. The address is used to access a program memory data table to produce an offset value which is added to the second byte 
of the input couplet. The updated data byte is output while the address byte of the next couplet is being input. The benchmark is 
written as a subroutine, with the size of the input stream being an input parameter. 

The program is written using a transmit/receive subroutine and takes 214 instruction cycles per couplet loop. This equates to 
107 instruction cycles (53.5 p.s) per byte, yielding an effective byte rate of 18.7 kHz. 



1/1 Save and increment byte couplet count to 

1/1 compensate for one byte throughput 

1/1 Clear data register 

1/1 Set up bit select for input 

1/1 Iristate bit 2 of PORIB 

1/2 Call XMITREC transmit/receive subroutine 

1/1 Move offset table address to W 

1/2 Call TABLE (table lookup subroutine) 

1/2 Call XMITREC subroutine 

1/1 Add offset to received data 

1/1 Decrement SIZE and test if zero 

1/2 Branch back if not finished 

1/2 Return from subroutine 

1/1 Set up bit count 

1/1 Bit count to BITCNT 

1/1 Reset clock output bit 

1/1 Reset data output bit 

1/1 Rotate right through carry 

1/1 Test carry bit and skip if clear 

1/1 Set data output bit 

1/1 Set clock output bit 

1/1 Clear upper bit of XRDATA 

1/1 Test receive bit 

1/1 Set upper bit of XRDATA if receive bit high 

1/1 Decrement count and skip if zero 

1/2 Branch back to XMITREC 

1/1 Reset clock output bit 

1/2 Return from subroutine 

1/1 Add offset to PC 

1/2 1st entry of data table 

-/- 2nd entry of data table 

30 WORDS (Equivalent to 45 BYTES) 
214 CYCLES per couplet Loop 



UPDATE : 


MOVWF 


SIZE 




INC? 


SIZE 




CLRF 


XRDATA 




MOVLW 


2 




TRIS 


PORTB 


LOOP: 


CALL 


XMITREC 




MOVE 


XRDATA, W 




CALL 


TABLE 




CALL 


XMITREC 




ADDWF 


XRDATA 




DECFSZ 


SIZE 




GOTO 


LOOP 




RETLW 





XMITREC: 


MOVLW 


8 




MOVWF 


BITCNT 


LUP: 


BCF 


PORTB, 




BCF 


PORTB, 1 




RRF 


XRDATA 




BTFSC 


F3, 




BSF 


PORTB, 




BSF 


PORTB, 1 




BCF 


XRDATA, 7 




BTFSC 


PORTB, 2 




BSF 


XRDATA, 7 




DECFSZ 


BITCNT 




GOTO 


LUP 




BCF 


PORTB, 1 




RETLW 





TABLE : 


ADDWF 


PC 




RETLW 


Data 1 




RETLW 


Data 2 
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C0P8 BENCHMARK #7— TIMEKEEPING 

This timekeeping benchmark is interrupt driven, using 5 ms. timer cycle interrupts. The program emulates a real time clock, 
keeping track of hours, minutes, and seconds in packed BCD format. 



5 ms low component for timer 

5 ms high component for timer 

Timer low byte 

Timer high byte 

Autoreload register low byte 

Autoreload register high byte 

CNIRL control register 

PSW control register 

5 ms counter 

Hour register (packed BCD format) 

Minute register (packed BCD format) 

Second register (packed BCD format) 

Temporary register 



TIMER SETUP ROUTINE: 




LO = 088 




HI = 013 




TMRLO = OEA 




TMRHI = OEB 




TAULO = OEC 




TAUHI = OED 




CNTRL = OEE 




PSW = OEF 




CNTR = 0E0 




HOUR = 0E1 




MIN = 0F2 




SEC = 0F3 




TEMP = 0F4 




TSETUP: LD 


B, #TAULO 


LD 


[B+], #LO 


LD 


[B+], #HI 


LD 


[B+] , #080 


LD 


[B+], #011 


LD 


[B+] , #200 


LD 


[B+], #1 


LD 


[B+], #0 


LD 


[B], #0 


SBIT 


4, CNTRL 


STALL: JP 


STALL 



2/3 
2/3 
2/2 
2/2 
2/2 
2/2 
2/2 
2/2 
2/2 
3/4 
-/- 

21 BYTES 



Load B pointer with address of low autoreload 

Load low autoreload reg with 5 ms component 

Load high autoreload reg with 5 ms component 

Set up timer PWM mode in CNTRL register 

Set up timer interrupt in PSW register 

Initialize 5 ms counter to count one second 

Initialize HOUR to 1 

Initialize MIN to 

Initialize SEC to 

Start timer 

Loop back on self to simulate main program 
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COP8 BENCHMARK #7- 


TIMEKEEPING (Conti 


nued) 




TIMER INTERRUPT SERVICE ROUTINE: 






TIMEKEEP: 


DRSZ 


CNTR 




1/3 


Decrement 5 ms counter 




RETI 






1/5 


Return from timer interrupt 




LD 


CNTR, #200 




2/3 


Reload 5 ms counter 




LD 


B, #TEMP 




2/3 


Load B pointer with address of TEMP 




X 


A, [B-] 




1/2 


Save A in TEMP 




JSR 


INCBCD 




2/5 


Call INCBCD subroutine to increment SEC 




RETI 






1/5 


Return from timer interrupt 




JSR 


INCBCD 




2/5 


Call INCBCD subroutine to increment MIN 




RETI 






1/5 


Return from timer interrupt 




SC 






1/1 


Set carry for BCD increment of HOUR 




LD 


A, #066 




2/2 


Set up BCD increment 




ADC 


A, [B] 




1/1 


Increment A in BCD 




DCOR 






1/1 


Decimal correct result of BCD increment 




X 


A, [B] 




1/1 


Store result in HOUR 




IEEQ 


A, #012 




2/2 


Test if result was BCD 13 (previous result 12) 




LD 


[B], #1 




2/2 


Reset HOUR from BCD 13 to BCD 1 




LD 


A, TEMP 




2/3 


Restore A from TEMP 




RETI 






1/5 


Return from timer interrupt 


INCBCD: 


SC 






1/1 


Set carry for BCD increment 




LD 


A, #066 




2/2 


Set up BCD increment 




ADC 


A, [B] 




1/1 


Increment A in BCD 




DCOR 






1/1 


Decimal correct result of BCD increment 




X 


A, [B] 




1/1 


Store result 




IEEQ 


A, #059 




2/2 


Test if result was BCD 60 (previous result 59) 




JP 


RESTORE 




1/3 


Jump to RESTORE if test successful 




RET 






1/5 


Return from subroutine 


RESTORE: 


LD 


[B-], #0 




2/3 


Reset to zero 




RETSK 






1/5 


Return from subroutine and skip 








39 BYTES 






TOTAL: 


60 BYTES 
80 CYCLES* 


*Case whe 


re 12:59:59 


is advancing to 


1:00:00 
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M68HC05 BENCHMARK #7— TIMEKEEPING 

This timekeeping benchmark is interrupt driven, using the timer output compare interrupt (in conjunction with the 131.072 ms 
timer cycle). The program emulates a real time clock, keeping track of hours, minutes, and seconds in packed BCD format. 
The timer clock is equivalent to the instruction cycle clock (2 MHz) divided by 4, yielding a timer clock period of 2 jus. The 16-bit 
free running timer has a maximum of 65536 counts, which at 2 ]as per count equates to a timer cycle of 1 31 .072 ms. One second 
divided by the timer cycle (1,000,000 divided by 131,072) yields 7 cycles with a residual of 82.496 ms. This residual equates to 
41248 timer counts, which represents the value (A120 in hex) added to the output compare register during each timer interrupt 
service. 



TIMER SETUP ROUTINE: 





0CIE 


EQU 6 




CNTR 


EQU 10 




TEMP 


EQU 11 




ATEMP 


EQU 12 




SEC 


EQU 20 




MIN 


EQU 21 




HOUR 


EQU 22 


TSETUP: 


CLR 


SEC 




CLR 


MIN 




CLR 


HOUR 




INC 


HOUR 




LDA 


#7 




STA 


CNTR 




BSET 


0CIE, 


STALL: 


JMP 


STALL 



Output compare interrupt bit in TCR (timer control reg) 



TCR 



2/5 
2/5 
2/5 
2/5 
2/2 
2/4 
2/5 
-/- 

14 BYTES 



Initialize SEC to 

Initialize MIN to 

Clear HOUR to 1 

Initialize HOUR to 1 

Load counter initialization value 

Initialize timer loop counter 

Enable Output Compare Interrupt 

Loop back on self to simulate main program 
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M68HC05 BENCHMARK #7— TIMEKEEPING (Continued) 

TIMER INTERRUPT SERVICE ROUTINE: ; OCMP = OUTPUT COMPARE 



TIMEKEEP: 


DEC 


CNTR 


2/5 




BNE 


FINI 


2/3 




STA 


TEMP 


2/4 




LDA 


#7 


2/2 




STA 


CNTR 


2/4 




LDA 


OCMPLO 


2/3 




ADD 


#$20 


2/2 




STA 


ATEMP 


2/4 




LDA 


OCMPHI 


2/3 




ADC 


#$A1 


2/2 




STA 


OCMPHI 


2/4 




LDA 


TSR 


2/3 




LDA 


ATEMP 


2/3 




STA 


OCMPLO 


2/4 




LDX 


#20 


2/2 




JSR 


INCBCD 


2/5 




BCC 


FIN 


2/3 




JSR 


INCBCD 


2/5 




BCC 


FIN 


2/3 




JSR 


BCDFIX 


2/5 




CMP 


A, #$013 


2/2 




BNE 


FIN 


2/3 




CLR 


X 


1/5 




INC 


X 


1/5 


FIN: 


LDA 


TEMP 


2/3 


FINI: 


RTI 




1/9 


INCBCD: 


JSR 


BCDFIX 


2/5 




CMP 


A, #$060 


2/2 




BNE 


BYP 


2/3 




CLR 


X 


1/5 




SEC 




1/2 




INCX 




1/3 


BYP: 


RTS 




1/5 


BCDFIX: 


LDA 


X 


1/3 




ADD 


#1 


2/2 




STA 


X 


1/4 




AND 


A, #$0F 


2/2 




CMP 


A, #10 


2/2 




BNE 


BYPASS 


2/3 




LDA 


#$010 


2/2 




STA 


X 


1/4 


BYPASS : 


RTS 




1/5 



Decrement 5 ms counter 

Branch if result zero 

Save A in TEMP 

Load A with counter reload value 

Reload timer loop counter 

Read OCMPLO 

ADD low byte of offset count 

Store in ATEMP until OCMPHI is updated 

Read OCMPHI 

Add high byte of offset count 

Update OCMPHI with new offset 

Read TSR to clear OCF flag 

Retrieve update for OCMPLO from ATEMP 

Update OCMPLO with new offset 

Load index register with address of SEC 

Call INCBCD subroutine 

Branch if carry clear 

Call INCBCD subroutine 

Branch if carry clear 

Call BCDFIX subroutine 

Compare A with BCD 13 

Branch if not equal 

Clear result 

Set HOUR to 1 

Restore A from TEMP 

Return from interrupt 

Call BCDFIX subroutine 

Compare A with BCD 60 

Branch if not equal 

Clear result (SEC or MIN) to zero 

Set carry 

Increment index 

Return from subroutine 

Load A with operand (SEC or MIN) 

Add 1 to A 

Store A in result 

Extract low order BCD digit 

Compare result with 10 

Branch if not equal 

Load A with BCD 10 

Store BCD 10 in result 

Return from subroutine 



*Case where 12:59:59 is advancing to 1:00:00 73 BYTES 



TOTAL: 



87 BYTES 
218 CYCLES* 
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80C51 BENCHMARK #7— TIMEKEEPING 

This timekeeping benchmark is interrupt driven, using 250 jas timer cycle interrupts. The program emulates a real time clock, 
keeping track of hours, minutes, and seconds in packed BCD format. 

TIMER SETUP ROUTINE: 



TSETUP: 



LOOP: 



CNTR1 


EQU R2 




CNTR2 


EQU R3 




TEMP 


EQU R4 




SEC 


EQU 20 




MIN 


EQU 21 




HOUR 


EQU 22 




MOV 


CNTR1, #20 


2/1 


MOV 


CNTR2, #200 


2/1 


MOV 


SEC, #0 


2/1 


MOV 


MIN, #0 


2/1 


MOV 


HOUR, #1 


2/1 


MOV 


TMOD, #02H 


3/2 


MOV 


THO, #-250 


3/2 


SETB 


TRO 


2/1 


MOV 


IE, #82H 


3/2 


SJMP 


LOOP 

s 


-/- 

1 BYTES 



Initialize 250 p,s counter to 20 

Initialize 5 ms counter to 200 

Initialize SEC to 

Initialize MIN to 

Initialize HOUR to 1 

Select Timer 0, Mode 2 

Setup 250 fis delay for Timer 

Start Timer 

Enable Timer interrupt 

Loop back on self to simulate main program 



33 



http://www.national.com 



80C51 BENCHMARK #7- 


■TIMEKEEPING (Continued) 






TIMER INTERRUPT SERVICE ROUTINE: 








TIMEKEEP: 


DJNE 


CNTR1, FIN 




2/2 


Decrement 250 |us counter and jump if not zero 




MOV 


CNTR1, #20 




2/1 


Reload 250 p,s counter 




DJNE 


CNTR2, FIN 




2/2 


Decrement 5 ms counter and jump if not zero 




MOV 


CNTR2, #200 




2/1 


Reload 5 ms counter 




MOV 


TEMP, A 




1/1 


Save A in TEMP 




MOV 


RO, #20 




2/1 


Load RO with address of SEC 




ACALL 


INCBCD 




2/2 


Call INCBCD subroutine 




JNC 


FIN 




2/2 


Jump if no carry 




ACALL 


INCBCD 




2/2 


Call INCBCD subroutine 




JNC 


FIN 




2/2 


Jump if no carry 




MOV 


A, §R0 




1/1 


Move HOUR to A 




CLR 


C 




1/1 


Initialize carry 




ADD 


A, #1 




2/1 


Add 1 to A 




DA 


A 




1/1 


Decimal adjust A for BCD correction 




MOV 


@R0, A 




1/1 


Return A to HOUR 




CJNE 


A, #013H, FIN 




3/2 


Compare and jump if hour not equal to 13 




MOV 


@R0, #1 




2/1 


Correct HOUR to 1 


FIN: 


MOV 


A, TEMP 




1/1 


Restore A from TEMP 




REII 






1/2 


Return from timer interrupt 


INCBCD: 


MOV 


A, §R0 




1/1 


Move operand to A (SEC or MIN) 




ADD 


A, #1 




2/1 


Add 1 to A 




DA 


A 




1/1 


Decimal adjust A for BCD correction 




MOV 


@R0, A 




1/1 


Return A to result register (SEC or MIN) 




CJNE 


A, #060H, BYPASS 




3/2 


Compare and jump if result not equal to 60 




MOV 


§R0, #0 




2/1 


Correct result to zero (SEC or MIN) 




SETB 


C 




1/1 


Set carry to indicate corrected result 


BYPASS : 


INC 


RO 




1/1 


Increment indirect address data pointer 




RET 






1/2 


Return from subroutine 






4 


5 


BYTES 






TOTAL: 6 

4 


6 
9 


BYTES 
CYCLES* 


*Case whe 


re 12:59:59 


is advancing to 1:C 





:00 
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PIC16C5X BENCHMARK #7— TIMEKEEPING 

This timekeeping benchmark uses a pseudo software 5 ms delay loop as the basic core timing for emulating a real time clock. 
Note that the PIC16C5X does not have any hardware interrupts. Consequently, the program must keep track of when the 
counter RTCC overflows for real time applications. The RTTC F1 file register is only 8 bits, but with an 8-bit prescaler selected 
(1 : 256 RTTC rate), the RTTC emulates a 1 6-bit timer. The program keeps track of hours, minutes, and seconds in packed BCD 
format. 

5 ms timing analysis: 0.5 jus (instruction cycle time) X 256 (prescaler) = 128 jus RTTC increment rate 
5 ms emulation: 5000/128 = 39 cycles + residue of 8 jus 



Register File Fl 
Reg File F3 
Reg File F20 
Reg File F21 
Reg File F22 
Reg File F23 
Reg File F24 





RTCC 


EQU 1 




STATUS 


EQU 3 




TEMP 


EQU 20 




SEC 


EQU 21 




MIN 


EQU 22 




HOUR 


EQU 23 




CNTR 


EQU 24 


TIMEKEEP: 


MOVLW 
OPTION 


7 




CLRF 


SEC 




CLRF 


MIN 




CLRF 


HOUR 




INCF 


HOUR 




MOVLW 


217 




MOVWF 


RTCC 


STALL: 


GOTO 


STALL 



1/1 
1/1 
1/1 
1/1 
1/1 
1/1 
1/1 
1/1 
-/- 

8 WORDS 



Set up data for option register 

Select a 1:256 prescaler with RTTC 

Initialize SEC to 

Initialize MIN to 

Clear HOUR 

Initialize HOUR TO 1 

256-39 = 217 (RTCC counter increments) 

Set up RTCC for 39 counts until overflow 

Loop back on self to simulate main program 



Main program must return within 5 ms to sense exactly when RTCC counter overflows 
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PIC16C5X BENCHMARK #7— TIMEKEEPING (Continued) 



TSTOVFLW: BTFSC 


RTCC, 7 


1/1 


GOTO 


TSTOVFLW 


1/2 


MOVLW 


217 


1/1 


MOVWF 


RTCC 


1/1 


DECFSZ 


CNTR 


1/1 


GOTO 


STALL 


1/2 


MOVLW 


200 


1/1 


MOVWF 


CNTR 


1/1 


MOVLW 


SEC 


1/1 


MOVWF 


F4 


1/1 


CALL 


INCBCD 


1/2 


BTFSS 


F3, 2 


1/1 


GOTO 


STALL 


1/2 


CALL 


INCBCD 


1/2 


BTFSS 


F3, 2 


1/1 


GOTO 


STALL 


1/2 


INCF 


HOUR 


1/1 


MOVLW 


10 


1/1 


SUBWF 


HOUR, W 


1/1 


MOVLW 


6 


1/1 


BTFSC 


F3, 2 


1/1 


ADDWF 


HOUR 


1/1 


MOVLW 


013H 


1/1 


SUBWF 


HOUR, W 


1/1 


MOVLW 


1 


1/1 


BTFSC 


F3, 2 


1/1 


MOVWF 


HOUR 


1/1 


GOTO 


STALL 


1/2 


INCBCD: INCF 


FO 


1/1 


MOVF 


FO, W 


1/1 


MOVWF 


TEMP 


1/1 


MOVLW 


OFH 


1/1 


ANDWF 


TEMP, W 


1/1 


MOVLW 


10 


1/1 


BSF 


F3, 


1/1 


SUBWF 


TEMP, W 


1/1 


MOVLW 


6 


1/1 


BTFSC 


F3, 2 


1/1 


ADDWF 


FO 


1/1 


MOVLW 


060H 


1/1 


BSF 


F3, 


1/1 


SUBWF 


FO, W 


1/1 


BTFSC 


F3, 2 


1/1 


CLRF 


FO 


1/1 


INCF 


F4 


1/1 


RETLW 





1/2 



Test if RTCC has overflowed 

Loop back if test fails 

256-39 = 217 

Set up RTCC for 39 counts until overflow 

Decrement CNTR and test for zero result 

Go to MAIN program 

Set up 200 (200 X 5 ms = 1 sec) 

Initialize 5 ms counter 

Address of SEC to W 

Address of SEC to F4 

Call INCBCD subroutine to increment SEC 

Test and skip if zero result (Z is F3 bit 2) 

Go to MAIN program 

Call INCBCD subroutine to increment MIN 

Test and skip if zero result 

Go to MAIN program 

Increment hour 

Move 10 to W 

HOUR - 10 to W 

Move 6 to W 

Test if subtraction result not zero 

Add 6 to HOUR for BCD correction 

Move BCD 13 to W 

HOUR - BCD 13 to W 

Move 1 to W 

Test and skip if subtraction result not zero 

Move 1 to hour if subtraction result zero 

Go to MAIN program 

Increment BCD data operand 

Move data to W 

Move data to TEMP 

Move hex OF to W 

Extract low order BCD digit 

Move 10 to W 

Set carry (reset borrow for subtraction) 

TEMP - 10 to TEMP 

Move 6 to W 

Test if subtraction result zero (Z is F3 bit 2) 

Add 6 to correct BCD operand if test successful 

Move BCD 60 to W 

Set carry (reset borrow for subtraction) 

BCD operand - BCD 60 to W 

Test if subtraction result zero 

Reset BCD operand to zero if test successful 

Increment indirect address pointer 

Return from subroutine 



*Case where 12:59:59 is advancing to 1:00:00 46 WORDS 



TOTAL: 



54 WORDS (Equivalent to 81 BYTES) 
69 CYCLES* 
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C0P8 BENCHMARK #8— SWITCH ACTIVATED FIVE SECOND LED 


This benchmark samples a switch input to activate a five second output for turning on an LED. The switch is debounced with a 
50 ms delay both on opening and closure. Once activated, the switch will turn on an LED output for five seconds and then turn it 
off, regardless of whether or not the switch is still activated. Once the switch is turned off, the procedure is repeated. Both the 
switch input and the LED output are low true. 




CNTR1 : 


= 0F0 




Three Counters in registers 0F0, 0F1, 0F2 




CNTR2 : 


= 0F1 








CNTR3 : 


= 0F2 








PORILD 


= 0D1 




PORTL Data register 




PORTLC 


= 0D2 




PORTL Configuration register 




PORTLI 


= 0D3 




PORTL Input address 


LED5: 


LD 


B, #P0RILD 


2/3 


PORTL data register address to B pointer 




LD 


[B+] , #0F0 


2/2 


Set upper 4 bits of data register for low true output 




LD 


[B+] , #0F0 


2/2 


Configure L port for upper nibble output and 
lower nibble input (inor B ptr to port input addr) 


WSWON : 


IFBIT 


0, [B] 


1/1 


Sample input switch (low true) 




JP 


WSWON 


1/3 


Wait for Switch ON 


SWON: 


JSR 


DLY50 


2/5 


Debounce 50 ms 




RBIT 


7, PORTLD 


3/4 


Turn on LED (low true) 




LD 


CNTR1, #100 


2/3 


Call DLY50 50 ms subroutine 100 times 


SEC5: 


JSR 


DLY50 


2/5 


to get 5 second LED ON time 




DRSZ 


CNTR1 


1/3 


Decrement CNTR1 and test for zero result 




JP 


SEC5 


1/3 


Loop back until count finished 




SBII 


7, PORTLD 


3/4 


Turn off LED (low true) 


WSWOFF 


: IFBIT 


0, [B] 


1/1 


Sample input switch (low true) 




JP 


SWOFF 


1/3 


Jump to Switch OFF 




JP 


WSWOFF 


1/3 


Wait for Switch OFF 


SOFF: 


JSR 


DLY50 


2/5 


Debounce 50 ms 




JP 


WSWON 


1/3 


Repeat procedure (wait for Switch ON) 


DLY50 : 


LD 


CNTR2, #33 


2/3 


Set up outer loop count 




LD 


CNTR3, #118 


2/3 


Set up initial inner loop count 


LOOP: 


DRSZ 


CNTR3 


1/3 


Decrement inner loop count and test for zero 




JP 


LOOP 


1/3 


Loop back until inner count finished 




DRSZ 


CNTR2 


1/3 


Decrement outer loop count and test for zero 




JP 


LOOP 


1/3 


Loop back until outer count finished 




RET 




1/5 


Return from subroutine 


"Cycle 


times without wait loopsC 


7 BYTES 






76 CYCLES* 


DLY50 


TIMING ANALYSIS: C 

Initial (2X3) 6 
1 x (117 x 6 + 1 x 10) r 


YCLES 
12 






32 x (255 x 6 + 1 x 10$ 


9280 






Terminating (5-2) C 








TOTAL 


50001 Equivalent to 50.001 ms (8 1 (is per cycle 
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M68HC05 BENCHMARK #8— SWITCH ACTIVATED FIVE SECOND LED 


This benchmark samples a switch input to activate a five second output for turning on an LED. The switch is debounced with a 
50 ms delay both on opening and closure. Once activated, the switch will turn on an LED output for five seconds and then turn it 
off, regardless of whether or not the switch is still activated. Once the switch is turned off, the procedure is repeated. Both the 
switch input and the LED output are low true. 




PORTB 


EQU $01 








DDRB 


EQU $05 








TEMP 


EQU $9F 






LED5: 


LDA 


#$F0 


2/2 


Output data and configuration to A 




STA 


PORTB 


2/4 


Set upper 4 bits of PORTB for low true output 




STA 


DDRB 


2/4 


Configure PORTB for upper nibble output 
and lower nibble input 


WSWON: 


BRSEI 


0, PORTB, WSWON 


3/5 


Sample input switch (low true) 


SWON: 


JSR 


DLY50 


3/6 


Debounce 50 ms 




BCLR 


7, PORTB 


2/5 


Turn on LED (low true) 




LDA 


#100 


2/2 


Call DLY50 50 ms subroutine 100 times 


SEC5: 


JSR 


DLY50 


3/6 


to get 5 second LED ON time 




DECA 




1/3 


Decrement count 




BNE 


SEC5 


2/3 


Loop back until count finished 




BSET 


7, PORTB 


2/5 


Turn off LED (low true) 


WSWOFF 


: BRCLR 


0, PORTB, WSWOFF 


3/5 


Sample input switch (low true) 


SWOFF : 


JSR 


DLY50 


3/6 


Debounce 50 ms 




BRA 


WSWON 


2/3 


Repeat procedure (wait for Switch ON) 


DLY50 : 


STA 


TEMP 


2/4 


Save A 




LDA 


#65 


2/2 


Set up outer loop count 




LDX 


#226 


2/2 


Set up inner loop count 


LOOP: 


DECX 




1/3 


Decrement inner loop count 




BNE 


LOOP 


2/3 


Loop back if non-zero 




DECA 




1/3 


Decrement outer loop count 




BNE 


LOOP 


2/3 


Loop back if non-zero 




LDA 


TEMP 


2/3 


Restore A 




RTS 




1/6 


Return from subroutine 






47 BYTES 






88 CYCLES* 


"Cycle 


times without wait loops 






DLY50 


TIMING ANALYSIS 
Initial (4 


C 

+ 2 + 2) E 


YCLES 






1 x (225 x 


6 + 1 x 11) 1 


361 






64 x (255 x 6 + 1 x 11) £ 


8624 






Terminating (3+6-1) £ 








TOTAL 


1 


00001 


Equivalent to 50.000 ms @ 0.5 fis per cycle 
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80C51 BENCHMARK #8— SWITCH ACTIVATED FIVE SECOND LED 

This benchmark samples a switch input to activate a five second output for turning on an LED. The switch is debounced with a 
50 ms delay both on opening and closure. Once activated, the switch will turn on an LED output for five seconds and then turn it 
off, regardless of whether or not the switch is still activated. Once the switch is turned off, the procedure is repeated. Both the 
switch input and the LED output are low true. 



Configure lower 4 bits of port PI for input 

Sample input switch (low true) 

Debounce 50 ms 

Turn on LED (low true) 

Call DLY50 50 ms subroutine 100 times 

to get 5 second LED ON time 

Decrement count and test and test for zero result 

Turn off LED (low true) 

Sample input switch (low true) 

Debounce 50 ms 

Repeat procedure (wait for Switch ON) 



2/1 Set up outer loop count 

2/1 Set up inner loop count 

2/2 Decrement inner loop count and test for zero 

2/2 Decrement outer loop count and test for zero 

1/2 Return from subroutine 

34 BYTES 
27 CYCLES* 



LED5: 


MOV 


PI, #00F 


3/2 


WSWON : 


JB 


P1.0, WSWON 


3/2 


SWON: 


ACALL 


DLY50 


2/2 




CLR 


PI. 7 


2/1 




MOV 


R2, #100 


2/1 


SEC5: 


ACALL 


DLY50 


2/2 




DJNZ 


R2, SEC5 


2/2 




SETB 


PI. 7 


2/1 


WSWOFF : 


JNB 


P1.0, WSWOFF 


3/2 


SWOFF : 


ACALL 


DLY50 


2/2 




AJMP 


WSWON 


2/2 



DLY50 : 



LOOP: 



MOV R3, #98 

MOV R4, #68 

DJNZ R4, LOOP 

DJNZ R3, LOOP 

RET 



*Cycle times without wait loops 



DLY50 TIMING ANALYSIS: 



CYCLES 



INITIAL (2x1) 2 
1 x (67 x 2 + 1 x 4) 138 
97 x (255 x 2 + 1 x 4)49858 
Terminating (2) 2 



TOTAL 



50000 Equivalent to 50.000 ms @ 1 jus per cycle 



39 



http://www.national.com 



PIC16C5X BENCHMARK #8— SWITCH ACTIVATED FIVE SECOND LED 

This benchmark samples a switch input to activate a five second output for turning on an LED. The switch is debounced with 
a 50 ms delay both on opening and closure. Once activated, the switch will turn on an LED output for five seconds and then 
turn it off, regardless of whether or not the switch is still activated. Once the switch is turned off, the procedure is repeated. 
Both the switch input and the LED output are low true. 



Register File F6 
Reg File F21 
Reg File F22 
Reg File F23 

Output data to W 

Set upper 4 bits of port for low true output 

Tri-state control to W 

Tri-state lower 4 bits of port for input 

Sample input switch (low true) 

Wait for Switch ON 

Debounce 50 ms 

Turn on LED (low true) 

Five sec count to W 

Call DLY50 50 ms subroutine 100 times 

to get 5 second LED ON time 
Decrement CNTR1 and test for zero result 
Loop back until count finished 
Turn off LED (low true) 
Sample input switch (low true) 
Wait for Switch OFF 
Debounce 50 ms 
Repeat procedure (wait for Switch ON) 

Outer loop count to W 

Set up outer loop count 

Inner loop count to W 

Set up inner loop count 

Decrement inner loop count and test for zero 

Loop back until inner count finished 

Decrement outer loop count and test for zero 

Loop back until outer count finished 

Return from subroutine 



27 WORDS (Equivalent to 40 1/2 BYTES) 
37 CYCLES* 

CYCLES 

4 

665 
99330 

1 

100000 Equivalent to 50.000 ms @ 0.5 |as/cycle 





PORTB 


EQU 6 






CNTR 


EQU 21 






CNTR2 


EQU 22 






CNTR3 


EQU 23 




LED5: 


MOVLW 


FOH 


1/1 




MOVWF 


PORTB 


1/1 




MOVLW 


OFH 


1/1 




TRIS 


PORTB 


1/1 


WSWON : 


BIFSC 


PORTB, 


1/1 




GOTO 


WSWON 


1/2 


SWON: 


CALL 


DLY50 


1/2 




BCF 


PORTB, 7 


1/1 




MOVLW 


100 


1/1 




MOVWF 


CNTR 


1/1 


SEC5: 


CALL 


DLY50 


1/2 




DECFSZ 


CNTR 


1/1 




GOTO 


SEC5 


1/2 




BSF 


PORTB, 7 


1/1 


WSWOFF : 


BTFSS 


PORTB, 


1/1 




GOTO 


WSWOFF 


1/2 


SWOFF: 


CALL 


DLY50 


1/2 




GOTO 


WSWON 


1/2 


DLY50 : 


MOVLW 


130 


1/1 




MOVWF 


CNTR2 


1/1 




MOVLW 


221 


1/1 




MOVWF 


CNTR3 


1/1 


LOOP: 


DECFSZ 


CNTR3 


1/1 




GOTO 


LOOP 


1/2 




DECFSZ 


CNTR2 


1/1 




GOTO 


LOOP 


1/2 




RETLW 





1/2 



*Cycle times without wait loops 

DLY50 TIMING ANALYSIS: 

Initial (4 x 1) 
1 x (220 x 3 + 1 x 5) 
129 x (255 x 3 + 1 x 5) 
Terminating (2 - 1) 

TOTAL 
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6.0 SUMMARY OF RESULTS (Both Positive ( + ) and Negative (-) 

Benchmark #1 - BLOCK TRANSFER 

COP8 Two indirect data pointers ( + ); IFBNE instruction ( + ) 

M68HC05 Indexed addressing ( + ); Decrementing index to zero ( + ) 

80C51 Two indirect data pointers ( + ) 

PIC16C5X Only one indirect data pointer (-) 
Benchmark #2 - BINARY ADDITION 

COP8 Two indirect data pointers ( + ); IFBNE instruction ( + ) 

M68HC05 Indexed addressing with offset ( + ) 

80C51 Two indirect data pointers ( + ) 

PIC16C5X Only one indirect data pointer (-) 
Benchmark #3 - BCD SUBTRACTION 

COP8 Two indirect data pointers ( + ); IFBNE instruction ( + ); DCOR (decimal correct) instruction ( + ) 

M68HC05 Indexed addressing with offset ( + ); No BCD correction instruction (-) 

80C51 Two indirect data pointers ( + ); DA instruction (BCD correction) only for add, not subtract (-) 

PIC16C5X Only one indirect data pointer (-); No BCD correction instruction (-) 
Benchmark #4 - TABLE SEARCH 

COP8 LAID instruction ( + ); Lack of IFNE comparison instruction (-); RETSK instruction ( + ) 

M68HC05 Indexed addressing ( + ); Lack of loading immediate values directly to memory ( — ) 

80C51 16-bit DPTR (data pointer ( + ); MOV A @A + DPTR instruction ( + ); CJNE instruction combine comparison 

and branch ( + ) 

PIC16C5X RETLW instruction ( + ) 
Benchmark #5 - INPUT/OUTPUT MANIPULATION 

COP8 I/O Port Configuration registers for individual bit control ( + ) 

M68HC05 I/O Port Data Direction reg's for individual bit control ( + ); No SWAP to reverse nibbles (-) 

80C51 Distinction between tristating for input versus output of 1 (?— ); No configuration register (— ) 

PIC16C5X TRIS (tristate instruction) ( + ); No configuration register (-) 
Benchmark #6 - SERIAL INPUT/OUTPUT WITH OFFSET TABLE 

COP8 Microwire/Plus for serial I/O ( + ); LAID instruction (+) 

M68HC05 SPI (Serial Peripheral Interface) for serial I/O ( + ) 

80C51 Serial I/O with 8-bit shift register mode cannot input and output simultaneously (-); MOV A, @A + DPTR 

instruction ( + ) 

PIC16C5X Lack of any dedicated serial I/O (-); RETLW instruction ( + ) 
Benchmark #7 - TIMEKEEPING 

COP8 Timer autoreload with interrupt very flexible ( + ) 

M68HC05 Timer Output Compare interrupt structuring for real time cumbersome and confusing ( — ) 

80C51 Lack of 16-bit timer autoreload (-) 

PIC16C5X OPTION instruction to select timer prescaler ( + ); Lack of timer interrupt (-); No timer autoreload (-) 
Benchmark #8 - SWITCH ACTIVATED 5 SEC LED 

COP8 Lack of bit testing for bit clear (-) 

M68HC05 Bit testing for both set and clear ( + ); Versatility of index register ( + ) 

80C51 DJNZ instruction combines testing and branch ( + ); JB and JNB instructions combine bit testing and 

branch ( + ) 

PIC1 6C5X Bit testing for both set and clear ( + ) 



TABLE I. The Benchmark Results: Code Size Efficiency in Bytes 




BENCHMARK BYTES 


National 
COP8 


Motorola 
68HC05 


Intel 
80C51 


Microchip 
PIC16C5X 


Five Byte Block Move 


7 


9 


11 


21 


Four Byte Binary Addition 


10 


13 


14 


28 i/ 2 


Four Byte BCD Subtraction 


25 


54 


39 


82 y 2 


Three Byte Table Search 


42 


40 


29 


51 


Input/Output Manipulation 


26 


42 


33 


37 Vs 


Serial I/O with Offset Table 


31 


31 


35 


45 


Timekeeping 


60 


87 


66 


81 


Switch Activated 5s LED 


37 


47 


34 


40 Vs 


TOTAL 


238 


323 


261 


387 


RATIO 


1 


1.36 


1.10 


1.63 
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TABLE II. The Benchmark Results: Code Execution Time Efficiency in Cycles and Microseconds 



BENCHMARK CYCLE/jiS 


National 
COP8 


Motorola 
68HC05 


Intel 
80C51 


Microchip 
PIC16C5X 


Five Byte Block Move 


47/47 


76/38 


32/32 


62/31 


Four Byte Binary Addition 


48/48 


85/42.5 


33/33 


62/31 


Four Byte BCD Subtraction 


97/97 


567/283.5 


91/91 


274/137 


Three Byte Table Search (Note 2) 


77/77 


80/40 


30/30 


52/26 


Input/Output Manipulation (Note 3) 


24/24 


53/26.5 


17/17 


21/10.5 


Serial I/O with Offset Table (Note 1) 


36/36 


54/27 


52/52 


214/107 


Timekeeping (Note 5) 


80/80 


218/109 


49/49 


69/34.5 


Switch Activated 5s LED (Note 4) 


76/76 


88/44 


27/27 


37/18.5 


TOTAL 


485/485 


1221/610.5 


331/331 


791/395.5 


RATIO 


1/1 


2.53/1.26 


0.68/0.68 


1.63/0.82 



Note 1: Couplet (address, data) loop time: address and data input, updated data output. 

Note 2: First search iteration falls with 1 st byte mismatch, second search iteration successful. 

Note 3: Case where P1 < P2. 

Note 4: Cycle times without wait loops. 

Note 5: Case where 12:59:59 is advancing to 1:00:00. 

7.0 CONCLUSIONS 

This report provides a detailed study of the instruction sets of popular 8-bit single chip microcontrollers. Eight benchmark 

programs, demonstrating data movement, arithmetic operations, I/O manipulation, and timekeeping, were coded for four 

current microcontrollers. 

In terms of byte efficiency, the COP8 from National, uses 8% less program space than its nearest competitor, the Intel 

80C51. The COP8 uses, 26% less program space than Motorola 68HC05, 39% less program space than the Microchip 

PIC16C5X. Code efficiency is important because it enables designers to pack more on-chip functionality into less program 

memory space. Selecting a microcontroller with smaller program memory size translates into lower system costs, and the 

added security of knowing that more code can be packed into the microcontroller. 

In terms of code execution, Intel's 80C51 is the fastest, while COP8 executes these benchmark routines faster than 

Motorola 68HC05 and Microchip PIC16C5X. 



LIFE SUPPORT POLICY 

NATIONAL'S PRODUCTS ARE NOT AUTHORIZED FOR USE AS CRITICAL COMPONENTS IN LIFE SUPPORT 
DEVICES OR SYSTEMS WITHOUT THE EXPRESS WRITTEN APPROVAL OF THE PRESIDENT OF NATIONAL 
SEMICONDUCTOR CORPORATION. As used herein: 

1. Life support devices or systems are devices or 2. A critical component is any component of a life 



systems which, (a) are intended for surgical implant 
into the body, or (b) support or sustain life, and whose 
failure to perform, when properly used in accordance 
with instructions for use provided in the labeling, can 
be reasonably expected to result in a significant injury 
to the user. 



support device or system whose failure to perform can 
be reasonably expected to cause the failure of the life 
support device or system, or to affect its safety or 
effectiveness. 
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